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Preface 


“Science longue patience” 
Louis Aragon 

This volume presents contributed papers of the Colloquium in Honor of Alain 
Lecomte, held in Pauillac, France, November 2-3, 2007 1 . This event was part of 
the ANR project Prelude 2 . The selected papers belong to the numerous scien¬ 
tific areas in which Alain has worked and to which he has contributed—formal 
linguistics, computational linguistics, logic and cognition. Being able to work in 
and across such diverse and active areas requires a bird’s eye view from high 
above. So it might have been predestination that Alain was born in Le Bourget, 
the former north airport of Paris, in 1947. His father was in fact a mechanic 
for the Aeropostale and for Air France thereafter, possibly also explaining why 
Alain is so keen on plane noise, the smell of kerosene and travelling. 

After studying in the high school of Drancy, he undertook studies in math¬ 
ematics in what was to become Jussieu. His interest in philosophy also led him 
to attend Althusser’s seminar. After enjoying May 1968 in Paris and spending a 
few months in Copenhagen, in 1969 he obtained a master’s degree in statistics in 
Grenoble where he became lecturer (assistant) at the IMSS (Institute of Math¬ 
ematics for Social Sciences). Resisting the call of the mountains surrounding 
Grenoble, Alain got more and more interested in linguistics: he was preparing 
for a PhD under the supervision of Jacques Rouault, who was leading a team 
called “Traitement automatique des langues et applications” (Natural Language 
Processing and Applications), issued from the former “Centre d’etude pour la 
traduction automatique” (Center for Research on Automated Translation) led by 
Bernard Vauquois. Alain passed his PhD in applied mathematics entitled “Essai 
de formalisation des operations linguistiques de predication” in 1974. Thereafter 
he spent two years teaching statistics in Oran, and returned to Grenoble at the 
end of the 1970s. 

Jacques Rouault knew Michel Pecheux because they were both following 
Antoine Culioli’s aproach to the formalization of linguistics. That is how Alain 
joined the research project conducted by Michel Pecheux: RCP Adela (Recherche 
Cooperative Programmee, Analyse clu Discours Et Lecture d’Archive) in 1980. 
Within this project, Alain, with Jacqueline Leon and Jean-Marie Marandin, 
focused on the incremental analysis of discourse. He was already involved in 
the logical aspects of such issues. The project also needed a parser and Michel 
Pecheux got in touch with Pierre Plante from Montreal who was developing 
one in Lisp, Deredec. A research program with UQAM (Universite du Quebec 


1 http://www.loria.fr/ pogodall/AL/ 

2 http://www.anr-prelude.fr/ 
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a Montreal) was launched, involving Alain, Jacqueline Leon and Jean-Marie 
Marandin, among others, on the French side. 

Corresponding to this interest in formalizing linguistic phenomena, the first 
contribution of this volume, by Claire Beyssade, provides a linguistic analysis 
of bare nouns in French. The main claim of this paper is that interpretative 
differences between bare nouns and indefinite nouns in predicate position in 
French derive from a difference between two types of judgements: the attributive 
ones and the identificational ones. Such a claim relies on an analysis of copular 
sentences for which the logical forms differ on whether they are built with bare 
nouns or indefinite noun phrases. 

The projects on automatic analysis of discourse (AAD) and the Adela team 
soon stopped after the death of Michel Pecheux in 1984. By that time, Jean- 
Marie Marandin had introduced Alain to Gabriel Bes (GRIL, Clermont-Ferrand). 
The GRIL was involved in a European project: DYANA (Dynamic interpretation 
of natural language, DYANA and DYANA-2, that lasted from 1988 to 1995) in¬ 
cluding a large part on categorial grammars, with people from Edinburgh (such 
as Claire Gardent, formerly member of the GRIL, or Glyn Morrill) and Utrecht 
(such as Michael Moortgat). The GRIL was looking for someone comfortable 
with the mathematical aspects of categorial grammars and Alain thus joined it. 
There, he familiarized himself with these grammars that would occupy him for 
several years. In 1990, he organized a DYANA workshop at Clermont-Ferrant 
and in 1992 he edited Word Order in Categorial Grammar, a collection of articles 
on categorial grammar deriving from this workshop. 

Because of his interest in formalization and modelling of linguistic phenom¬ 
ena, in particular from the logical point of view, Alain could not miss what is 
now a well-established and continuously exciting event: the European Summer 
School of Logic, Language, and Information, ESSLLI. Indeed, Alain attended 
ESSLLI in 1990 and since then, he attended, lectured, or presented a communi¬ 
cation at most ESSLLI venues: 1990, 1991, 1992, 1995, 1997, 1998, 1999, 2000, 
2002, 2005, 2006, 2007, 2008, 2009 and 2010. 

His interest in logic for natural language led him to meet Christian Retore (co¬ 
editor of this volume) in 1993. Because Alain was looking for non-commutative 
versions of linear logic, Vincent Danos suggested he get in touch with Christian 
who had just obtained a PhD degree with Jean-Yves Girard on partially ordered 
sequents and the “before” connective (pomset logic). Alain showed Christian how 
useful non-commutative calculi are for linguistics. Although they never worked 
in the same location, this was the starting point of a fruitful collaboration. 
At this time, while teaching in Grenoble, Alain was part of the GRIL and in 
1994, he defended his habilitation thesis entitled “Modeles logiques en theorie 
linguistique” in front of a jury consisting of Michele Abrusci, Gabriel Bes, Jean- 
Pierre Descles, Michel Eytan and Michael Moortgat. When Christian moved to 
Nancy, in October 1994, Alain played an important role in the creation of the 
INRIA Project team Calligramme, led by Philippe de Groote, soon joined by 
Frangois Lamarche and Guy Perrier. The theoretical center of the team was 
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linear logic, especially proof-nets, and the application areas were concurrency 
but also—and soon mainly—computational linguistics. 

One of the key ideas Alain and Christian had was to map words not to 
formulas as in categorial grammar, but to partial proof-nets (of Lambck calculus 
or of pomset logic), and to view parsing as assembling partial proof-nets into 
a complete correct proof. They presented their work to the Roma workshops 
organized by Michele Abrusci and Claudia Casadio on Lambek calculus, linear 
logic and linguistic applications, or to Formal Grammar around ESSLLI venues. 
In 1995, Alain became professor in epistemology, philosophy of sciences and logic 
in Grenoble and was the dean of his faculty for several years. 

Partial proof-nets as lexical entries was rather a broad notion of a grammar, 
encompassing rich syntactic formalisms, e.g., TAG, but more symmetrical. This 
approach permitted one to model discontinuous constituents or relatively free 
word order while sticking to the idea of “proof as analysis.” As in categorial 
grammar, from this proof could be recovered semantic readings in a Montague- 
like style. Alain and Christian also thought of inverting this process, and wrote 
a PhD proposal with Marc Dymetman on this topic: their common student 
Sylvain Pogodalla (co-editor of this volume) made a contribution to this topic 
in the more standard framework of Lambek grammars: “Reseaux de preuves et 
generations pour les grammaires de type logique”(defended in 2001 in Nancy). 

In the present volume, the second paper also pertains to the logical view of 
language. It aims at reconstructing the Cooper storage method for quantifiers 
within a type-theoretic framework: convergent grammar (CVG). In this paper, 
Carl Pollard motivates the CVG framework by analyzing the evolutions of Chom¬ 
sky’s transformational model through the minimalist program and by comparing 
it with the categorial grammar approaches. This leads to considering syntactic 
trees as proof trees. But, contrary to standard categorial grammar, the semantics 
terms do not directly result from the syntactic terms. They are instead built in 
parallel using a purely derivational calculus for the syntax-semantics interface. 

While the CVG framework itself does not consist in a rephrasing of the Min¬ 
imalist Program, it highlights some of its relations to type theory and logical 
grammar. It interestingly echoes that at the beginning of the 2000s, because of 
Alain’s knowledge about generative grammar and Noam Chomsky’s work, and 
because of Christian’s interest in this formalism, Alain gave a series of semi¬ 
nars on this topic in Nancy issuing in a joint work for a categorial treatment of 
minimalism, and in particular of Ed Stabler’s minimalist grammars (MG) first 
presented at Logical Aspects of Computational Linguistics (LACL), a confer¬ 
ence launched by Alain and Christian in 1996. The point of giving a catego¬ 
rial view of minimalism was to provide it with better semantic representations. 
First, the proof expresses valency consumption of linguistic resources, but then 
a process computes from the proof either word order or semantic representa¬ 
tion. This topic was an important one in the INRIA group Signes that Chris¬ 
tian started in Bordeaux in September 2002 and to which Alain actively took 
part. In addition to joint papers on that subject, they also co-advisecl two PhD 
students: Maxime Amblard, “Calculs de representations semantiques et syntaxe 
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generative: les grammaires minimalistes categorielles,” and Houda Anoun, “Ap- 
proche logique des grammaires pour les langues naturelles,” both defended in 
Bordeaux in 2007. 

Witnessing the activity in this area, the next three papers elaborate on various 
aspects of bridging the minimalist program and categorial grammars. In the first 
one, Richard Moot builds on proof nets for Lambek grammars to give a graphical 
perspective on two different categorial grammar-based accounts of the move 
operation of the Minimalist Program. This new perspective allows him to unify 
the two accounts and to overcome some of their drawbacks, lexicalizing subparts 
of the proof trees corresponding to syntactic trees where a move operation has 
been triggered. This also makes a very interesting link to the syntactic recipes 
automatically inferred for categorial grammars on large corpora. 

Maxime Amblard’s contribution deals with a type-theoretic formulation of 
minimalist grammars: the minimalist categorial grammars (MCG). This paper is 
a step in a more general program: to provide MG with a theoretically motivated 
syntax/semantics interface. It focuses on the first step of the proof of mutual 
inclusion of languages between MG and MCG. This paper gives an original 
definition of minimalist grammars based on an algebraic description of trees 
which allows one to check properties of this framework and which provides a 
description suitable for comparison with generated languages of frameworks. 

The next contribution, by Sylvain Salvati, also provides an original view of 
MG. This perspective, which relies on reinterpretations of MG in light of logic, is 
innovative and quite ambitious, and could also be applied to other grammatical 
formalisms. A first interpretation of MG within abstract categorial grammars is 
given in which MG derivations are represented as terms. The latter can then be 
interpreted either as syntactic trees or strings, as usual, or interpreted as seman¬ 
tics terms, providing MG with another syntax-semantics interface. It also shows 
that the membership problem for MG is at least as difficult as multiplicative 
exponential linear logic provability. Finally, it also presents a monadic second 
order-based interpretation of MG derivations, providing them with a descriptive 
rather than rule-based interpretation. 

A very important characteristic of natural language lies in its learnability. 
Taking this property into account is a very interesting feature of MP. This prop¬ 
erty has also been studied in the framework of type-logical grammars, leading 
to various results. In this volume, Isabelle Tellier and Daniel Dudau-Sofronie 
present learnability results for some class of Lambek grammars. It builds on 
adding semantic information, namely types, to the syntactic available ones un¬ 
der some condition of compositionality. 

While these kinds of properties are established on formal and logical systems, 
this work raises interesting questions on the cognitive abilities exhibited by the 
language faculty. Alain has also contributed to this area. In 2006 when he decided 
to move from the university of Grenoble to the one of Paris 8 in order to enjoy 
a more linguistically oriented department, he started to collaborate with Pierre 
Pica on the Mundurucu numeral system. In addition to this work and while 
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pursuing his research on categorial minimalist grammars, Alain also became 
seriously interested in semantics and pragmatics and also in ludics. 

In Geocal-Marseille 2006, Alain met Marie-Renee Fleury and Myriam Qua- 
trini (co-editor of this volume), who organized a workshop on Computational 
Linguistics and Linear Logic. All three were convinced that ludics, a theory of 
logic conceived by Jean-Yves Girard and presented as a “Theory of Interaction,” 
was a relevant framework to formalize pragmatics, and they decided to explore 
such a possibility. Alain then launched the national research program on this 
issue, Prelude, that has been renewed for the forthcoming years as Loci. Within 
this program, he has specifically worked on dialogue modelling in ludics with 
Marie-Rene Fleury, Myriam Quatrini and Samuel Trongon. Based on their use 
of ludics to represent dialogue and pragmatics, in collaboration with Myriam 
Quatrini, Alain has proposed a new framework for semantics, extending in the 
ludical framework the intuitionistic idea according to which the semantics of a 
formula (utterance) is the set of its proofs (justifications). Moreover, the ludical 
model of semantics that they propose has lots of links with the game semantics 
tradition. 

The last contribution of this volume, by Marie-Renee Fleury, Myriam Qua¬ 
trini and Samuel Trongon, gives a formal model for dialogs based on ludics. They 
show how certain notions of dialogue relate to some fundamental concepts of lu¬ 
dics. In particular, players of a dialogue are not confined within the rigid rules 
of formal logic but instead can explore a space where a partial, incomplete or 
even “incorrect” proof can occur. This exploration makes use of the notion of 
interaction which is central both in ludics and in dialogue. 

While this (non-exhaustive) chronology of Alain’s work makes it sound as 
though he could legitimately retire and stop contributing to this multidisciplinary 
area of logic and linguistics, we are quite confident, given how young in spirit 
and fit he is, that the aforementioned results are just the beginning and that 
Alain’s best scientific contributions are still to come! 

We wish to thank Catherine Pequenat, Alain’s wife, for providing us with 
some biographic elements. We are also very grateful to Jacqueline Leon and Jean- 
Marie Marandin for their help in setting the scientific chronology of Alain’s career 
and to Nicholas Asher for his stylistic advice. And we are of course responsible 
for any remaining mistakes. 


January 2011 


Sylvain Pogodalla 
Myriam Quatrini 
Christian Retore 
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Bare Nouns in Predicate Position in French 


Claire Beyssade 


Institut Jean Nicod, CNRS-ENS-EHESS, 29, me d'Ulm, 
75005 Paris, France 

claire,beyssade@ehess.fr 


Abstract. In this paper we examine the differences between bare singular nouns 
and indefinite singular NPs in predicate position in French. Our claim is that the 
semantic value of the singular indefinite determiner is not empty in French and 
that various interpretative contrasts between bare singular nouns and indefinite 
nouns in predicate position can be accounted for if a distinction between two 
rules of predication supported by copular sentences is introduced. We assume 
that bare nouns denote properties, which can be attributed to individuals, while 
indefinite noun phrases denote entities, which can be identified with an 
individual in context. This distinction between two types of statements, 
attributive ones and identificational ones, takes its source in Higgin’s typology, 
and will be compared with Roy’s and Heller and Wolter’s works on predicative 
and specificational sentences. 

Keywords: indefinite, bare noun, copular sentence, property. 


1 Introduction 

It is frequently assumed that French doesn't allow productive bare nouns, neither in 
argument positions, nor in predicate positions. Such an assumption is in contradiction 
with the following data, some of which are well-known and studied in the recent 
literature, (la) corresponds to autonymic uses of bare nouns, (lb-c) to bare nouns in 
coordination (cf Roodenburg, 2004), (Id) to bare nouns in compound verbs, (le) to 
bare nouns in prepositional phrases, (lf-h) examplify the uses of bare nouns in 
negative polarity contexts. Such examples need to be explained, and comparison 
seems to play a crucial role in (lg-h). (li-j) illustrate some occurences of bare nouns 
in phrases of the type N prep N, (lk-m) present some cases of bare nouns which are 
not often mentioned in linguistic studies, and (ln-o) show that bare nouns may appear 
in copular sentences. 

(1) a. Cordiste est un metier d'avenir qui demande une double competence, 
sportive et technique. 

[A] harnessed climber is a up-and-coming trade which requires both 
technical and athletic skills 1 


1 We indicate with square brackets [] words which are required in the English translation, but 
are missing in the French sentence. 
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b. Livres et journaux jonchaient le sol. 

Books and newpapers lay everywhere on the ground 

c. II faut disposer d'un compte bancaire pour toucher salaire on retraite . 

You need to have a bank account in order to receive [your] wages or 

pension 

d. prendre rendez-vous, faire usage , avoir faim, so if mal au coeur. .. 

take [an] appointement, make use of, to be hungry, to feel sick 

e. un pot en fer, un probleme de taille, une idee de genie ... 

a pot made of metal, a problem with size, a genius’idea 

f. Jamais homme n ’entra dans ce couvent. 

No man has ever stepped in that convent 

g. Je n ’avais jamais vu femme si belle. 

I’ve never seen such [a] beautiful woman 

h. Vous ne trouverez pas hotel plus agreable dans la region. 

You won’t find [a] more pleasant hotel in the area 

i. II a bu biere sur biere . 

He drank beer after beer 

j. Elle s'embellit jour a pres jour . 

She gets prettier day afer day 

k. Ce nouveau telephone fait appareil photo. 

This new phone can be used as [a] camera 

l. Cette explication fait sens . 

This explanation makes sense 

m. Si probleme il y a, n 'hesite pas a me rappeler. 

In case of problem, don’t hesitate to call me 

n. Jean est ami avec le directeur. 

Jean is [a] friend with the director 

o. Jean est professeur . 

Jean is [a] professor 

In this paper, we won't analyze all of these configurations. We will only focus on 
bare nouns in copular sentences, and more specifically on the contrast between bare 
singular nouns and indefinite singular nouns in copular sentences. Our aim is to show 
that copular sentences built with indefinite singulars (IS) differ in crucial ways from 
sentences built with bare singulars (BS): we can explain the interpretative and 
distributional differences between IS and BS by analyzing copular sentences built 
with IS as relying on an identity relation rather than on predication. 

(2) a. Jean est un clown. Jean is a clown 

b. Jean est clown. Jean is [a] clown 

After a presentation in §2 of various interpretative and distributional contrasts 
between IS and BS in copular sentences, we propose in §3 to revisite Higgin's 
typology of copular sentences and to distinguish between predication and equation, 
grouping copular sentences built with IS together with identificational, specificational 
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and equative sentences. Such an analysis allows to explain both the alternation 
between il and ce pronouns in copular sentences (§4) and the restrictions on modified 
bare nouns (§ 5). 


2 Contrasts between Bare Nouns and Indefinite Nouns in Copular 
Sentences 

In Semantics in Generative Grammar, Heim and Kratzer (1998:61) assume that "the 
indefinite article a is vacuous when it occurs in predicate nominals such as a cat in 
Kaline is a cat". This assumption, even if it was true for English, can’t be applied to 
French, since, as (2) illustrates, copular sentences with and without indefinites may 
convey different meanings. (2b) means that John is a clown by profession, while (2a) 
just has a metaphoric meaning, and is adequate if John behaves as a clown, i.e. if the 
speaker judges that John is a funny person. This observation is due to Laca and 
Tasmovski (1994) and can be added to other contrasts which distinguish between BS 
and IS in copular sentences. 

First of all, ISs, contrary to BSs, are incompatible with small clauses (3). Secondly, 
ISs, contrary to BSs, cannot appear in sentences with interruptive or intermittent 
readings (4). Fastly, the difference between ISs and BSs correlates with different 
pronouns in left dislocation (5). 

(3) a. Marie imagine Paul ministre. 

b. *Marie imagine Paul un ministre. 

Mary imagines Paul a minister 

(4) a. Paul est medecin le jour, chanteur la nuit. 

a'. * Paul est un medecin le jour, un chanteur la nuit. 

Paul is a doctor during the day, a singer during the night 

b. Paul est traducteur a ses heures libres. 

b'. * Paul est un traducteur a ses heures libres. 

Paul is a translator after hours 

c. Paul a ete professeur a trois occasions dans sa vie. 

c'. * Paul a ete un professeur a trois occasions dans sa vie. 

Paul has been a teacher three times in his life 

(5) a. Paul, (il / * cj est traducteur. 

b. Paul, (?il /cj est un traducteur. 

Paul, (he / CE) is a translator 

All of these constrasts suggest a possible parallelism between bare nouns and 
adjectives: like adjectives, bare nouns denote properties, that can be attributed to a 
stage of individual and that can give rise to temporary interpretation. On the 
contrary, sentences built with indefinite singulars seem to convey a different 
meaning, where the indefinite nouns denote a stable property or more precisely 
denote an individual characterized by a permanent property. A comparison can be 
made between BS vs IS on the one hand, and adjective vs nominalization on the 
other hand, as in examples (6). 
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(6) a. Jean est bossu (temporary property) 

b. Jean est un bossu (permanent property) 

Jean is a hunchbacked 

There are some other differences between IS and BS uses. BSs, contrary to ISs, can 
be followed by as qua expressions (7) and can be modified by a prepositional phrase 
including a superordinate of the noun (8). f urthermore, BSs and ISs don't trigger the 
same type of inferences concerning the existence of the subject of the copular 
sentence. It is usual to consider that (9a) triggers the implicature that Pierre is dead, 
contrary to (9b), which doesn't trigger any implicature of this kind. Conversely, when 
one considers sentences built with present tense, the sentence with IS is not associated 
with any lifetime implicature, while the sentence with BS seems inappropriate when 
the subject denotes an individual who is dead, like in (10b) (cf Matushansky and 
Spector 2003, 2005). 

(7) a. En tant que medecin, Pierre n'a pas voulu prendre position sur ce sujet. 

b. * En tant qu'un medecin, Pierre n'a pas voulu prendre position sur ce 
sujet. 

As a doctor, Pierre didn’t want to take a stand on this subject 

(8) a. Pierre est avocat de profession. / * Pierre est un avocat de profession. 

b. Pierre est chretien de religion. / * Pierre est un chretien de religion. 

c. Pierre est frangais de nationality. / * Pierre est un frangais de nationality. 

Pierre is a (lawyer / Christian / French) by (profession / religion / 
nationality) 

(9) a. Pierre etait un medecin. 

Pierre was a doctor 

b. Pierre etait medecin. Maintenant il est retraite. 

Pierre was [a] doctor. Now he is retired 

(10) a. Balzac est un ecrivain. 

Balzac is a writer 
b. ? Balzac est ecrivain. 

The last well-known contrasts between IS and BS concern the modification of the 
noun, which is much more restricted with BS than with IS. 

(11) a. Jean est un medecin (generaliste / honnete / qui a plus de 50 ans). 
b. Jean est medecin (generaliste / *honnete / * qui a plus de 50 ans). 

Jean is a general practitioner doctor / an honest doctor / a doctor who is 
more than 50-year-old 

These contrasts are not new, they are presented in different papers, and in particular 
in Roy (2006) or in de Swart et al. (2006, 2007). Here our aim is just to present 
them again in order to propose an analysis of copular sentences which can explain 
some of them. 
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3 Two Types of Judgments: Predication vs. Equation 

To account for these interpretative and distributional differences, we propose to 
introduce a distinction between different types of copular sentences. We focus on two 
classes of copular sentences, (i) copular sentences built with a bare noun and (ii) 
copular sentences built with an indefinite noun. We associate these copular sentences 
to two distinct logical forms. 

(12) a. Jean est clown 

b. [[DP is BS]] = 1 iff [[DP]] has P BS 
or in other words 
iff Pbs e t[DP]] 

DP means 'determiner phrase', BS means 'bare singular noun’ and P B s refers to the 
property denoted by the bare noun. Thus the subject DP in (12a) is analyzed as a 
generalized quantifier. 

(13) a. Jean est un clown 

b. [[DP is IS]] = 1 iff [[DP]] = [[IS]] 

IS means ’indefinite singular noun’. (13) conveys an identity statement, in which it 
is claimed that the DP in subject position and the postverbal IS have the same 
denotation. 

Another way to express the difference between these two types of copular 
sentences is to show that the copula in each of these sentences doesn't play the same 
role. In (12), the copula corresponds to XPP, while in (13) it corresponds to XyXx 
(x=y). In (12), the bare noun is comparable to an adjective and denotes a property, 
which is attributed to a subject. The copula is just used to bind or compose the subject 
and the post-copular noun. 

(14) is:XPP 
clown : XxC(x) 

is clown : AP (P) XxC(x) which reduces to XxC(x) 

If we analyze any indefinite NP as a generalized quantifier, we obtain the 
following composition for a VP built with a copula and an IS such as 'est un clown'. 
The copular is viewed as expressing relative identity (cf Gupta 1980): A.P Xx (P) Xy 
(x=y). 

(15) is: A.P Xx (P) Xy (x=y) 

a : XQ XR 3z (Q(z) a R(z)) 
clown : XxC(x) 
a clown : XR 3z (R(z) a C(z)) 

is a clown : XP Xx (P) Xy (x=y) ( XR 3z (R(z) a C(z)) ) 
which reduces to 



6 


C. Beyssade 


Ax (AR 3z (R(z) a C(z))) Ay (x=y) 

and finally to 

Ax ( 3z (x=z) a C(z)) 

The present proposal is distinct from what can be found in the literature, and in 
particular distinct from Higgins', Roy's, and Heller and Wolter's analyses, summarized 
in table 1. 


Table 1. Typology of copular sentences 


Higgins 

Roy 

Heller and Wolter's 

Beyssade & Sorin 

Predicational 

i) characterizing 

ii) defining 

iii) situation-descriptive 

i) ordinary predication 

ii) quiddity predication 
(including 
identificationals) 

Predicational 

Equative 

Equative 

Equative 

Identity : equatives, 

identificationals, 

specificationals 

Identificational 

Identificational 


Specificational 

Specificational 

Specificational 


The important point is that here we draw a demarcation line between predicational 
and non predicational copular sentences, and that we analyze copular sentences built 
with bare nouns as predicational sentences, while copular sentences built with 
indefinites nouns are viewed as instances of non predicational sentences. Let's note 
that in all the other proposals found in the literature no clear distinction is established 
between copular sentences built with IS vs with BS. 

In Higgins typology, IS copular sentences may be predicational or non 
predicational sentences, according to the context. 

In Roy's thesis, copular sentences, either built with IS or with BS, are viewed as 
cases of predication: defining predicates are expressed by ISs, characterizing 
predicates are expressed by BSs, and situation-descriptive predicates are expressed by 
adjectives. It should follow from such an analysis that small clauses would be 
compatible, both with IS and with BS, which is not the case. 

(16) Marie imagine Paul (0 / *un) ministre. 

Mary imagine Paul (0 / a) minister 

And finally, Heller and Wolter propose to introduce a distinction between two 
types of predicates: (i) predicates which express ordinary predication, and (ii) what 
they called quiddity predicates, which provide an answer to the question ‘What is 
that?’. In addition to expressing a property of the entity, quiddity predicates tell us 
something about the essence or nature of the entity. They don't give any strong 
argument to justify why they analyze quiddity predicates as instances of predication 
and not as instances of equatives or specificationals. We have observed that in French, 
ordinary properties, which express secondary properties, are typically expressed via 
BS and the verb faire (cf (17a)) while quiddity predicates are expressed via IS 
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(cf (17b)). Consequently, we will analyze quiddity sentences as cases of non 
predicational sentences. The sentence (17a) is not about a camera, but rather about an 
object that has a secondary function as a camera. 

(17) a. Ce telephone fait appareil photo. (ordinary property) 

This cell phone is a camera. 
b. C'est un appareil photo. (quiddity) 

This is a camera. 


4 Alternation between il and ce in French Copular Sentences 

It is well-known (cf a.o. Kupferman 1979, Tamba 1983, Boone 1987, Beyssade & 
Sorin 2005) that there is an alternation between pronouns ce / il in French left 
dislocation constructions, which depends on the post-copular element: the pronoun ce 
‘that’ appears when the post-copular phrase is an IS, while il / elle 'he / she' is used 
with a BS. 

(18) a. Jean, c’est (un chanteur / *chanteur). 

John, CE is (a singer / singer) 
b. Jean, il est (? un chanteur/chanteur). 

John, il is (a singer / singer) 

We observe that BSs behave as adjectives in this type of configuration, while 
copular sentences built with IS can be grouped with equative, identificational and 
specificational sentences since the pronoun which appears in left dislocation 
constructions is ce and not il /elle. 

(19) Jean, (il/* c’) est beau. 

John, (he / CE) is beautiful 

(20) a, C/ar/c Kent, (c'/?iZ) est Superman. eqnative 

b. Ca, (c'/ *i/J est Jo/in. idenfificalionai 

c. Leprob/eme, (cV *i/J est Jo/in. specificafionaZ 

This observation provides a new argument to analyze copular sentences with BS as 
predicational sentences and to group copular sentences built with IS with equative, 
identificational and specificational sentences in the class of identity statements. 

What does the use of ce vs i I/elle indicate? Let us recall that pronouns may be used 
to give some information about the denotation of their antecedents. According to 
Blanche-Benveniste (1990, 41-42), "pronouns give a grammatical description which 
may be more fine-grained than lexical words”. For instance, the difference between 
ils and ca in (21) has to do with the type of denotation of the noun phrase les chiens. 
In (21a), the pronoun refers to a particular set of dogs, while in (21b), it refers to the 
kind 'dog', and the sentence is a generic one. 
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(21) a. Les chiens, ils aboient. 

The-dogs-they-are-barking The dogs are barking 

b. Les chiens, ga aboie. 

The-dogs-QA-barks Dogs bark 

Furthermore, certain pronouns are sensitive to the difference between singular and 
plural, some other ones (like for instance the French generic one ga) are not. Same 
thing with the pronoun le, when it is anaphoric to a proposition as in (22b). It doesn't 
vary, whether it refers to one or several propositions. 

(22) a. (Le chien / Les chiens), ga aboie. 

(The-dog / The dogs)-£A-barks Dogs bark 

b. Jean est poete. Marie est danseuse. Tout le monde (le / *les) sait. 

John is a poet. Mary is a dancer. Everybody (LE-sg / LES-pl) knows. 

We can observe that the opposition |+Human] / [-Human] is not relevant here for 
distinguishing ce vs il/elle, since in the sentences which we are interested in, every 
subject NP refers to humans. Moreover, il / elle can be used to refer to non human 
noun phrases, like in (23): 

(23) La soupe, elle est trop chaude. 

The-soup-ELLE-is-too-hot 

According to us, the relevant difference between ce and il/elle has to do with the 
type of denotation. Contrary to ///elle which refers to an entity which is identified and 
can be type-shifted as a set of properties, ce refers to an entity without identity. In 
other terms, the reference of ce is not strong, but weak (Dummett 1973, 1981), 
exactly as indefinite noun phrases can be weak, when they are incorporated (cf van 
Geenhoven 1996, McNally and van Geenhoven 1998) or when they appear in 
presentational sentences (Me Nally 1997, Moltmann 2007). Dummett suggests that 
this and that in English refer to pre-individuated portions of reality, and thus involve 
reference without identity. They involve indeterminate reference that leaves open 
what entity exactly is being referred to. Our proposal is that ce, contrary to il/elle, has 
weak reference, and then can not be type-shifted from type e to type ((e,t),t). It is why 
ce can appear in identity sentences, but not in predicational sentences. 11/ Elle may 
refer to individual of type e, which can be type-shifted in set of properties (i.e. type 
((e,t),t)). 

(24) Ce can only denote entities (type e). It cannot be type-lifted to denote sets of 
properties. 

Ce is grammatical in identity copular sentences such as Jean, e'est un chanteur, 
because such a sentence doesn't rely on predication, but rather on identity. Inversely, 
ce is ungrammatical in predicational copular sentences such as Jean, e'est chanteur, 
because ce doesn't refer to a generalized quantifier, ce has weak reference, and can't 
be type-shifted from type e to type ((e,t),t). 
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5 Modified Bare Nouns in Predicate Positions 

We will focus here on the class of expressions that can occupy the post copular 
position without determiner. It has been observed that in Romance languages only a 
restricted class of common nouns can be used without any determiner (a.o., 
Kupferman 1979, Pollock 1983, Boone 1987, Laca and Tasmowski 1994, Roy 2006, 
Matushansky and Spector 2003, de Swart and al. 2007). This class includes 
professions, titles and functions: 

(25) Professions ( medecin 'doctor', avocat 'lawyer'...), titles ( prince 'prince', baron 
'baron', roi 'king'...), hobbies ( chasseur 'hunter', alpiniste 'climber',...), 
functions ( president 'president', ministre 'minister', senateur 'senator'...), 
status (etudiant 'student', SDF 'homeless'...) 

This thesis has been recently infirmed by Mari and Martin (2008), which claim that 
basically every common noun can be used bare in copular sentences, and propose to 
give an unified analysis for sentences such as (26a) and (26b). 

(26) a. Marie est docteur. 

Mary is [a] doctor 

b. Marie est voiture /salade /mini-jupe. 

Mary-is (-car/-salad/-mini-skirt) 

Mary is the (car / salad / mini-skirt) type 

Even if it is interesting to put examples like (26b) in the picture, it seems to us that 
it isn't justified to associate the same type of logical form with (26) a and b. Such an 
analysis blurs important contrasts between (26a) and (26b), as illustrated by (27). If 
(26b) may be paraphrased by (27d), there is no equivalence between (26a) and (27c). 

(27) a. Marie est tres (*docteur / voiture /salade/ mini-jupe). 

Mary-is-very (doctor / car / salad / mini-skirt) 

b. Marie est (un docteur / * une voiture / * une salade/ * une mini-jupe). 
Mary-is- (a-doctor / a-car / a-salad / a-mini-skirt) 

c. Marie aime les docteurs. 

Mary likes doctors 

d. Marie aime la voiture / la salade / les mini-jupes. 

Mary-likes- the(sg)-car / the(sg)-salad / the(pl)-mini-skirts 

And finally, while (26a) is absolutely non marked, (26b) are marked and need some 
context to be interpreted as well-formed. It's the reason why we won't consider 
examples of the type of (26b) in this paper. They deserve a separated study. 

Our aim in this part of the paper isn't to propose a new characterization of the class 
of the nouns which can be bare in copular sentences. We consider that the description 
proposed in Beyssade and Sorin (2005) is on the good track. They claim that these 
nouns refer to non sortal properties i.e. to properties that are associated with a 
principle of application but no principle of identity (cf Gupta 1980). According to 
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Gupta, the principle of identity is just supplied by the common noun connected with a 
determiner or a quantifier. When a common noun is used bare, it refers to a role, a 
guise, which doesn't directly refer to an individual, but to a secondary property of the 
object. Consequently, bare common nouns are comparable to adjectives, they refer to 
secondary properties, that are not associated with principles of identity. 

What we want to study here is the variety of expressions that can appear in post- 
copular position, without determiner. It seems that, besides names of role, there are a 
bunch of more complex expressions that can be used without determination, as 
illustrated in (28): 

(28) a. Jean est (professeur de mathematiques / pere de trois enfants). 

John is [a] (professor of Mathematics / [the] father of three children) 

b. Jean est fils d'avocat. 

John is [the] son of a lawyer 

c. Jean est bon danseur. 

John is [a] good dancer 

In each case, a property is attributed to an individual, but in each case, the way to 
build a complex property from a simple one is different. We describe here three 
different possibilities, the list may not be exhaustive. 

A first type of complex property is shown in (28a), where the noun of guise is 
modified by another noun preceded by a functional preposition. Professeur de 
mathematiques defines a subtype of professor, and pere de trois enfants is a subtype 
of father. It is important to note that (28a) implies that John is a professor, or that John 
is a father. 

Another way to build a complex noun of guise is to use a noun of guise as 
argument of a relational noun, and more specifically a kinship noun such as son, 
wife... Fils d’avocat or femme de ministre denotes a complex property. 

(29) a. Jean est fils d'avocat. 

Jean is [a] son of lawyer 
a'. Jean est (*0 /le) fils (de Marie /d'un avocat). 

Jean is (0 / the) son (of Mary / of a lawyer) 
b. Marie est femme de ministre. 

Marie is [a] minister’s wife 
b'. * Marie est (*0 / la) femme du ministre. 

Marie is (0 / the) wife of the minister 

We assume that kinship nouns such as fils, femme may denote a relation not only 
between two individuals but also between an individual and a property (role/guise). 
Correspondingly, fils d'avocat in (29) denotes a complex property, obtained by 
applying a function (the word fils is represented in (30a) by A.P Xx fils (x, P)) to a 
property (avocat). (30a) can be reduced in (30b), which shows that fils d'avocat 
denotes a property. It is interesting to note that if the argument of the relational noun 
does not denote a guise, but an individual (cf (29a') and (29b')), then the complex 
expression can not be used bare, but need to be preceeded by a determiner. 
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(30) a. A.P Ax fils (x, P) (avocat) 
b. Xx fils (x, avocat) 

The third way of building a complex property consists in modifying an adjective 
by a noun of guise. We claim that bare nouns can be taken to denote both a property 
and a property modifier. This idea is borrowed from Fox (2000), who has proposed to 
formalize property modifiers in the framework of Property Theory. In example (28c), 
which illustrates the type of cases, the noun of guise is analyzed as a property 
modifier rather than as a property: the property attributed to Jean is not the property of 
being a dancer, but the property of being good, as a dancer. 

This analysis presents several advantages compared to some other recent proposals 
(Larson 1998, de Swart et al. 2007). 

First, our analysis predicts that there is no entailment from (31a) to (31b): danseur 
in (31a) does not have the restricted meaning of professional dancer, contrary to what 
happens with bare nouns. Indeed, the property attributed to the subject in (31a) is not 
the property to be a dancer, but the property to be good. 

(31) a. Jean est bon danseur. 

John is [a] good dancer 
b. Jean est danseur de profession. 

John is [a] dancer by profession 

It is only when they denote a property that BNs have the restricted meaning of 
capacity. In all other contexts, they have an underspecified meaning: (31a) can be 
understood as meaning 'Jean is beautiful when he dances', and not necessarily as 'Jean 
is a professional dancer who dances beautifully'. According to our proposal, a noun 
like danseur is semantically ambiguous in French, it can denote a property, or a 
property modifier. 

(32) a. danseur as property D or Xx D(x) 

b. danseur as a property modifier /,P Xx Vs (D(x,s) —» P(x,s)) 

(32b) translates the fact that danseur can modify a property P and yields another 
property. This new property can be attributed to an individual x if and only if, in a 
situation s where x has the property D (i.e. when x dances, as a professional dancer or 
not), x also has the property P. Thus (28c) can be analyzed as a predicative sentence 
in which bon danseur denotes a complex property that is attributed to Jean. 

The second advantage of our proposal is that it can be extended to account for 
examples of the type shown in (33), which are usually analyzed as lexicalizations or 
idioms. Within our account, they can be instead analyzed in terms of property 
modification: 

(33) a. Jean est (beau / gentil) gargon. 

Jean is [a] (beautiful / nice) guy 
b. Marie est (vieille fille /jeune grand-mere) 

Mary is [a] (old maid / young grandmother) 
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And finally, we can understand why there are restrictions on the adjectives which 
can be used in examples like (34a). The more usual are good and bad, and all their 
variants like beautiful, awful... One can find examples with young but they are less 
frequent, and it often seems difficult to use old instead of young. Young in (35a-b) 
means young in this function and can be paraphrased by depuis peu, recently. 

(34) a. Paul est bon eleve. 

Paul is [a] good student 
b. Jean est pietre avocat. 

John is [a] very mediocre lawyer 

(35) a. Quand je serai jeune retraite,.... 

When I am a young retired person, ... 
b. Marie est jeune deputee et ne commit pas encore les usages. 

Mary is a young member of Parliament and doesn't know yet the manners 

All other adjectives are excluded of these constructions, and in particular 1-level 
adjectives, which denote permament properties, like rich, parisian... 

(36) a. *Paul est (beau / riche) professeur. 

Paul-is(-nice/-rich)-professor 
b. *Paul est professeur parisien. 

Paul-is-parisian-professor 

In fact, the only adjectives that can appear in this type of construction belong to the 
class of what Siegel (1976) named non intersective but subsective adjectives. 
According to him, intersective adjectives are of type (e,t) while subsective adjectives 
are of type ((e,t), (e,t)) and modify the noun with which they are phrased: a sentence 
like John is a good dancer is ambiguous because the adjective good is ambiguous, 
and may be intersective or subsective. Larson (1995) has proposed another analysis 
for the same type of examples. According to him, the ambiguity doesn't come from 
the adjectives but from the nouns like dancer, which introduce two variables, one 
event variable and one individual variable. The lexical content associated with the 
noun dancer corresponds to XxXe. dancer (x,e). Larson (1995) proposed to import 
Davidson’s analysis of adverbial modification to adjectival modification. To do that, 
he relativizes the semantics of common nouns like dancer to events, he analyzes 
adjectives as predicates and he allows adjective phrases to be predicated either of x or 
e. Consequently, a beautiful dancer may be associated with both logical forms given 
in (37): 

(37) a. Xe Xx [dancing (x, e) & beautiful(x)] ‘ beautiful danceri’ 

b. Xe Xx [dancing (x, e) & beautiful(e)] ‘ beautiful dancer 2 ’ 

Larson's analysis accounts for the ambiguity of (38), which can be associated with 
the two following Logical Forms (38b-c), presented as tripartite structures Quantifier 
[Restriction] [Nuclear scope]. 
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(38) a. John is a beautiful dancer 

b. Ve [dancing(e, j)] [beautiful (j)] 

c. Ve [dancing(e, j)] [beautiful(e)] 

The general situation is the following: some adjectives apply strictly to non-events 
( for instance aged), others apply strictly to events (for instance former) and still others 
apply naturally to both, yielding ambiguity (for instance beautiful ). Larson's analysis 
is very interesting and is perhaps adequate for English, but it can't be used to account 
for French data, because there is a contrast in French between copular sentences with 
IS and copular sentences with BS, which has no equivalent in English. English doesn't 
have bare nouns in predicate positions 2 . 

Our proposal is distinct from Siegel's one and from Larson's one. According to us, 
bon danseur is viewed not as a bare noun modified with an adjective, but as an 
adjective modified by a noun of property. Bon danseur means "bon en tant que 
danseur", "bon, quand il danse". When one combines a bare noun with an adjective, 
one obtains a complex property, built from the adjective and modified by the bare 
noun. We consider that the core of the phrase bon danseur is the adjective bon, not 
the noun danseur. It is why our analysis is very different from Larson's one: what is 
relevant in this type of construction is the adjective, which support various 
constraints, not the noun. We don’t analyze the noun in (39) as a noun of event . It is 
the adjective which supports the major predication in (39), the noun only modifies it, 
and is used to impose a restriction on the denotation of the subject. The subject is 
interpreted as a qua-objet, as defined by Fine (1982). More generally, we can say that 
when an adjective is modified by a noun of property, the sentence can be paraphrased 
by (40). This explains why the noun has to be in a relation with a verb, and the 
adjective has to be related with an adverb. 

(39) a. Jean est bon danseur. 

John is [a] good dancer 

(40) a. DP, as N, is Adj. 

b. DP, When he Vderived from [\, Vderived from \| AdVderived f rom Adj 

Our proposal presents two empirical advantages over others: it can be extended to 
comparable constructions, including an adjective modified by a noun or a participe, 
which can't be used bare without the adjective, like in (41). In fact, besides nouns of 
profession which can be used as property modifiers, most of deverbal nouns can also 
appear in this position. 

(41) a. Jean est bon/ mauvais perdant 

John-is-(good / bad)-looser 

a'. * Jean est perdant (with the meaning of a status) 

John-is-looser 

b. Jean est-il beau parleur, desespere ou cretin ? 

Is John [a] smooth talker, [a] desesperate man or [a] moron? 

b'. * Jean est parleur. 

John-is-talker 


2 Let us note two exceptions: Chairman and President may be used bare in predicate position. 
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Furthermore, we can add to the list of adjectives which can appear in these 
constructions, grand, which is in relation with the adverb grandement, or gros, which 
may be contextually recategorized as an adverb, as in (42c-d). Grand and gros are 
qualified by Szabo (2001) as evaluative adjectives: this means that they present a as 
qua position which may remains unsaturated (cf (43)). 

(42) a. Jean est grand amateur d'art. 

John-is-big-lover-of-art 

b. Jean est gros buveur de biere. 

John-is-big-drinker-of-beer 

c. Jean joue gros 

John-plays-GROS 'John plays a lot of money' 

d. (Ja peut te couter gros. 

CA-may-dative-cost-GROS 'It may cost you a lot' 

(43) a. John is tall John is tall as a person. 

b. Everest is tall Everest is tall as a mountain. 

6 Conclusion 

The main claim of this paper is that interpretative differences between bare nouns and 
indefinite nouns in predicate position in French derive from a difference between two 
types of judgments. We have proposed to analyze copular sentences built with bare 
nouns as predicational sentences, and copular sentences built with indefinite noun 
phrases as identity sentences. Consequently, bare nouns present some similarities with 
adjectives, which denote properties, whereas indefinite noun phrases are viewed as 
individual denoting phrases, just like proper names or definite noun phrases. 

We have also shown how to account for modified bare nouns in this framework. 
Very frequently, when a bare noun co-occurs with an adjective in a copular sentence, 
the head of the postcopular phrase is not the bare noun, but the adjective. 
Nevertheless, in certain cases such as (44), both analyses may be possible : either the 
head of the postcopular phrase is the adjective (and the bare noun is an adjective 
modifier), or the head of the phrase is the bare noun (and the adjective modifies the 
noun). Thus, simple soldat, petit commergant and danseur professionel are ambiguous 
and may be analyzed either as an adjective phrase or as a noun phrase. 

(44) a. Jean est (simple soldat /petit commergant). 

John is [a] (regular soldier / little storekeeper ) 
b. Paul est danseur professionel. 

John is [a] professional dancer 

Finally, some issues concerning bare nouns in French have been leave aside. In 
particular, nothing was said about restrictions on the class of nouns which can appear 
without determiner in predicate position. However, we hope that the line of thought 
proposed here may provide some pointers for further studies of bare nominals and 
their special status at the syntax-semantics interface. 
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Abstract. We propose a formal reconstruction of the well-known storage- 
and-retrieval technique for scoping q uantif iers and other ‘covertly moved’ 
semantic operators due to Cooper (1 19 7511 . In the proposed reconstruc¬ 
tion, grammar rules are presented in the familiar term-labelled Gentzen- 
sequent style of natural deduction. What is new is that, in addition to the 
usual contexts to the left of the turnstile (recording undischarged pairs 
of hypotheses, with each pair consisting of a syntactic variable (‘trace’) 
and a corresponding semantic variable), our typing judgments also in¬ 
clude a co-context to the right of the co-turnstile (H). A co-context 
consists of a list of semantic variables, each paired with a quantifier that 
corresponds to the meaning expressed by a quantified noun phrase whose 
scope has not yet been specified. Besides the usual logical rules, the gram¬ 
mar also contains rules called Commitment and Responsibility that 
implement, respectively, storage and retrieval of semantic operators. 


1 Introduction 


From the mid-1970s until the emergence of Chomsky’s Minimalist Program (MP, 


199a ) in the 1990s, the mainstream of research on natural-language syntax in 
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much of the world embraced a theoretical architecture for syntactic derivations 
that came to be kn own a s the T-model. According to this model, which underlay 
Chomsky’s ( 1976 . il97 7?> Extended Standard Theory (EST) of the 1970s and its 
successor, the Government-Binding (GB) Theory f Chomsky Il98li) of the 1980s 
and early 1990s, a tree called a deep structure (DS) is generated from lexical 
entries by essentially context-free base rules. The DS is then converted into a 
surface structure (SS) by transformations, destructive structural operations 
that can delete, copy, or (most significantly for us) move subtrees. From SS, the 
derivation branches (the two arms of the T): in one direction the SS is further 
transformed into a phonetic form (PF), which determines what the expression 
being analyzed sounds like, and in the other direction the SS is transformed into 
a logical form (LF), which determines what the expression means. 

In the T-model, the tranformations that convert DS to SS are called overt, 
because their effects are (at least potentially) audible (since the branch of the 
derivation that leads to PF is yet to come). The prototypical case of overt move¬ 
ment is overt wh-movement in languages like English, where constituent ques¬ 
tions are formed (so the theory goes) by moving a wh-expression (or, in so-called 
pied-piping constructions, an expression properly containing a wh-expression) 
to the left periphery of a clause. Since both PF and LF are derived from SS, this 
movement is subsequently reflected in both how the sentence sounds, and what 
it means: 


(1) Overt Wh-Movement in the T-Model 

a. I wonder who Chris thinks Kim likes. 

b. DS: (I wonder (Chris thinks (Kim likes who))) 

c. SS: (I wonder (who* (Chris thinks (Kim likes t)))) 

d. LF: (I wonder (who^, (Chris thinks (Kim likes x)))) 

Here, the wh-operator who occupies an argument (A) position at DS. 
After overt movement, it occupies a nonargument (A) position in SS on the 
left periphery of one of the clauses that contained it; in this sentence, the only 
clause it can move to is the middle one (with subject Chris), because the verb 
wonder is the kind of verb that requires an interrogative complement clause. 
When who moves, it leaves behind a trace or syntactic variable (here, t), 
which it binds at SS; this is essentially the same position it will occupy at LF. 
Now since derivations branch to PF (and LF) after SS, the movement of who has 
an audible reflex (you hear it in the position it moved to). And finally, during 
the SS-to-LF derivation, a rule of construal replaces t with a logical variable 
(here, x), which is bound by who at LF. 

Now nobody with even a rudimentary knowledge of lambda calculus or pred¬ 
icate logic could fail to notice that the SS in (Hell and the LF in (II dt look a 
lot like formal terms containing operators that bind variables. But, at least as 
far as I know, no logician has ever suggested that A’s, or 3’s, or V’s, actually 
start out in the position of the variables they bind, and then move to the left. 
So one might well ask why transformational grammarians, right down to the 
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present day, believe that binding operators in NL do. At least 30 years ago, 
practicioners of categorial grammar (CG) (e.g. David Dowty, Emmon Bach) 
and phrase structure grammar (PSG)(e.g. Gerald Gazdar, Geoff Pullum) 
started asking this very question, and in the intervening decades researchers in 
these frameworks have proposed a wealth of carefully thought out theories in 
which NL binding operators do not move. We will come back to this. 

By contrast with overt movement ( within the T-model), transformations 
that convert SS to LF are called covert because they take place too late in the 
derivation—after the SS branch point—to have a reflex at PF. O ne sta ndardly 
assumed covert movement is quantifier raising (QR, May 11 9771 i 198 -bl ) . which 
moves a quantificational NP (QNP) to a position in LF (reflective of its semantic 
scope) higher than the one it occupied at SS. 


(2) Covert Wh-Movement in the T-Model: QR 

a. I know Chris thinks Kim likes everyone. 

b. DS: (I know (Chris thinks (Kim likes everyone))) 

c. SS: (I know (Chris thinks (Kim likes everyone))) [no change ] 

d. LF (narrow scope reading): (I know (Chris thinks (everyone^ (Kim likes 

»)))) 

e. LF (medium scope reading): (I know (everyone^ (Chris thinks (Kim 
likes x)))) 

f. LF (wide scope reading): (everyone^, (I know (Chris thinks (Kim likes 

*)))) 


Here, the QNP everyone occupies an argument (A) position at DS, and 
nothing happens to it between DS and SS (no overt movement). But after covert 
movement, it occupies a nonargument (A) position in LF on the left periphery 
of one of the clauses that contained it. Now when everyone moves, it leaves 
behind a logical variable (here, x), which it binds at LF. But since derivations 
branch after SS to PF and LF, and the movement of everyone is on the the SS- 
to-LF branch, it has no audible reflex (you hear it in its pre-movement position). 

Another standardly assumed covert movem ent is covert wh-movement 
in languages like Chinese (Huang et al. Il992h . IPesetskv Il987 ~) 1 . Covert wh- 
movement is supposed to be essentially the same as overt wh-movement, except 
that, since—like QR— it takes place after the SS branch point, it is heard just 
as if it had never moved (or, to use the syntactician’s term of art, it remains 
in situ). 


(3) Covert Wh-Movement in the T-Model: W7i-in-Situ 

a. Zhangsan xiang-zhidao shei mai-le shenme. [Chinese] 

b. Zhangsan wonder who bought what [English word-for-word gloss] 

1 But see Aoun and Li (119931 ) for a dissenting view (that Chinese ui/i-movement is 
overt movement of an inaudible operator, with the ui/i-expressions as bindees, not 
binders.) 
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c. DS: (Zhangsan xiang-zhidao (shei mai-le shenme)) 

d. SS: (Zhangsan xiang-zhidao (shei mai-le shenme)) [no change] 

e. LF (shei and shenme both narrow): 

(Zhangsan xiang-zhidao (sheLc shenme y (x mai-le y))) 
‘Zhangsan wonders who bought what’ 

f. LF (shenme narrow, shei wide): 

(sheia,(Zhangsan xiang-zhidao (shenme^ (x mai-le y)))) 

‘Who does Zhangsan wonder what (s/he) bought?’ 

g. LF (shenme wide, shei narrow): 

(shenme y (Zhangsan xiang-zhidao (slrei,,, (x mai-le y)))) 

‘What does Zhangsan wonder who bought?’ 


Here, as with QR, there is no change between DS and SS. Each of the wh- (or, 
in Chinese, sh -)operators can scope to any of the clauses containing it. However, 
in this example, at least one of them must scope to the lower clause, since the 
clausal complement of the verb xiang-xhidao ‘wonder’ has to be a question. 

In fact, even languages like English with overt wh-movement also have in situ 
wh, in two different respects. First, in multiple constituent questions, all 
but the leftmost wh-expression remain in situ. And second, in cases of pied pip¬ 
ing, the wh-expression that is properly contained within the moved constituent 
remains in situ, relative to the displaced constituent that contains it. In this 
paper, however, we will limit our attention henceforth to phenomena that trans¬ 
formational grammar (TG) has analyzed purely in terms of covert movements. 

In the rest of the paper, we sketch an approach to so-called covert phenomena 
in which (as in logic) binding operators never move. (For the extens ion of this 
approach to so-called overt movement phenomena, see Pollard 2008bh 


2 Toward a New, Nontransformational Synthesis 


The T-model has long since been abandoned. Withi n the Chomskyan syntactic 
tradition, the Minimalist Programm (MP, Chomsky 1995 ) provides much more 
flexibility than EST or GB did, by discarding the notions of DS and SS. Instead, 
merges (corresponding to EST/GB base rules) need not all take place before any 
moves do. And the possibility of multiple branch points in a single derivation 
(‘Spell-outs’) means that not all overt moves must occur ‘lower’ in the derivation 
than any of the covert ones. These are not exactly negative developments; but it 
is well worth noting that, had transformational grammarians followed the lead 
of CG and PSG practitioners from the 1970s on in informing their theory by 
ideas from logic (as opposed to logic metaphors), the architectural problems of 
EST/GB that the MP has sought to repair could have been addressed much 
early, or even avoided altogether. Here are a few examples. 

First, in EST/GB, as noted above, LF is derived from SS. But an LF looks 
a lot like a semantic lambda-term, and so, in light of t he Cu rry-Howard (types 
as formulas, terms as proofs) conception (Curry et al. 19581 . Howard 1980 ). we 
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should be able to think of it as an (intuitionistic) proof in its own right. So there 
is no reason why it has to be derived from SS (or anything else). 

Second, also as noted above, an EST/GB labelled bracketing, which typically 
contains traces (syntactic variables) and inaudible operators which bind them, 
also looks a lot like a lambda term. But by then (1970s to early 1980s), Lam- 
bek (119581 1 had already long since proposed that NL syntax be formulated 
in terms of a substructural proof theory. Moroever the idea of extending the 
Curry-Howard conception to substructural logics was continually being redis¬ 
covered 2 ; so, in hindsight at least, it is easy perceive these labelled bracketings 
as Curry-Howard terms for some resource-sensitive logic or other. But in that 
case, linguists should think of NL syntactic trees as proof trees , as Moortgat 
i 199 l! l and other categorial grammarians had already realized in the mid-to- 
late 1980s, not as structures whose subtrees can be deleted, copied, or moved by 
transformations (and whose internal structural configurations could be relevant 
in the formulation of linguistically significant generalizations). 

Third (given t he pre ceding), there is no need to stipulate a Strict Cycle Con¬ 
dition (Chomsky 11976 1 on rule application (roughly, that once a rule has applied 
to a given tree, it is already too late for any rule to apply solely to one of that 
tree’s proper subtrees), for the simple reason that a proof cannot go back and 
change earlier parts of itself! 

And fourth, also in hindsight, it is clear that the notion of SS is not only 
unnecessary but pernicious. That is because SS is the stage of the derivation 
at which all base rule applications (merges) have taken place but none of the 
transformational rule applications (moves). In proof theoretic terms, what SS 
amounts to is a point in a proof subsequent to which only instances of Hypo¬ 
thetical Proof (but not Modus Ponens) are admitted! But there is no requirement 
on proofs that all instances of Modus Ponens appear lower in the proof tree than 
all instances of Hypothetical Proof, just as there is no well-formedness condition 
on lambda terms that all the abstractions occur on the left periphery of the 
term. 

If these observations are on the right track, then the syntax and semantics 
of NL expressions are both proofs in their own right. But then, a grammar 
should not be in the business of tranforming syntax into semantics; rather, it 
should be specifying which syntax-semantics pairs of proofs 3 go together. To 
put it another way, the syntax-semantics interface should be at once purely 
derivational and parallel. Here, by purely derivational, we mean simply that 
derivations are proofs, as opposed to nondeterministic algorithms that build 
arboreal structures via successive destructive modification. And by parallel , we 
mean that there are separate proofs theories that provide, respectively, candidate 
syntactic and semantic proofs; whereas it is the job of the syntax-semantics 
interface to recursively define the set of proof pairs that belong to the language 
in question. 


2 E.g. (Mi nt,sll98ll : van Benthem ll983l : Buszkowski ll987l : Jav ll989l: Benton et al. Il992l : 
Wansing ll992l ; Gabbav and de Queiroz Il992l ; Mackie et al. ll 99. 'fli . 

3 Or triples, if phonology is also taken into account. 
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The pure derivationality of the proposed approach comes straight out of CG, 
and the syntactic proof theory we will adopt below will be readily taken for 
what it is, a variant of (multimodal) applicative categorial grammar. However, 
the mainstream of CG 4 has eschewed parallelism, in favor of the functional 
approach to semantic interpretation bequeathed by Montague, which mandates 
that there can never be a purely semantic ambiguity. Rather, on the functional 
approach, there must be a function from syntactic proofs/terms 5 to semantic 
proofs/terms; or, to put it another way, all meaning differences must be disam¬ 
biguated in the syntax. 6 

But I am not aware of any scientific basis for requiring that the relation 
between syntactic derivations and semantic ones be a function. Indeed, there is 
a long tradition 7 which rejects the premise that the syntax-semantics relation 
is a function from the former to the latter. I will refer to this tradition as the 
parallel approach to the syntax-semantics interface. The framework I will be 
using below, called Convergent Grammar (CVG), while purely derivational, 
also lies squarely within this parallel tradition. 8 

In fact, the idea of a purely derivational parallel grammar architecture has 
already been proposed independently and c onsi derably ear lier in this decade by 
Lecomte and Retore (Lecomte and Retore 120021 . Lecomte l2005h , and there are 
numerous points of similarity between their approach and CVG. However, un¬ 
like their approach, which is part of a larger intellectual enterprise (categorial 
minimalism) which seeks to bring about a marriage of CG and MP, the in¬ 
tellectual tradition to which CVG belongs is one that parted company with (to 
use Culicover and Jackendoff’s term) mainstream generative grammar (MGG) 
more than three decades ago. As will be made clear shortly, CVG is really a 
proof-theoretic embodiment not of minim alism but r ather of the storage and 
retrieval technology proposed by Cooper ( 1975 . 19831) as an alternative to the 
then-current EST/GB. 


4 E.g. (van Benthem ll983l : Lambek l 19881 : Mor rillll994l : Steedman 199fil : Moo rtgat|l990l : 
Carpenter Il997l : Jacobson Il999l: de G roote l2001bl : Ranta 12004 ; Mu skens 120031 : Pol¬ 
lard l2004l : Anoun and Lecomte 120071 : Bernardi and Moortgat 20071 ) . 

5 Or, in the case of Montague, analysis trees. 

6 There is a trivial respect in which any relational syntax-semantics interface can be 

rendered functional by allowing sets of usual meanings to serve as ‘meanings’, since 
there is a canonical correspondence between binary relations and functions from the 
domain of the relation to the powerset of the codomain (the category of relations is 
the Kleisli category of the powerset monad on sets). But linguists generally require 
of meanings, however they are modelled, that they provide a deterministic interpre¬ 
tation for contextualized utterances. Thus, we rule out as meanings nondetermistic 
(or underspecified) representations (as in the MRS (minimal recursion semantics) 
employed in some versions of head-driven phrase structure grammar (HPSG)) that 
have to be postpr ocessed to r esolve scopal ambi guities . _ 

7 See, e.g. (Cooper Il975l . 1 19831 : Bach and Partee 19801 : Hendriks Il993l : Pollard and 
Sag ll994 : Lecomte and Retore 120021 : Lecomte 1200.4 Culicover and .1 ackendoff l200fll ) . 

8 This is scarcely surprising, si nce it originated as an effort to reformulate HPSG along 
type-theoretic lines (Pollard 120041 ). 
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The kinds of ambiguities associated with the so-called covert-movement phe¬ 
nomena, illustrated above in © and ©, bear directly on the functional vs. paral¬ 
lel issue. Indeed, on parallel approaches, they readily lend themselves to analyses 
that locate the ambiguities wholly in the semantics, rather than complicating 
the syntax for the mere sake of preserving the (putative) functionality of the 
syntax-semantics interface at any cost. To put it simply, it is entirely permissi¬ 
ble, at a certain point in a pair of simultaneous derivations (one syntactic; one 
semantic), to do something on the semantic side while doing nothing at all on 
the syntactic side. And as we will see shortly, the Cooper-inspired storage and 
retrieval rules in terms of which we analyze covert movement are of this precise 
character. 


3 Syntax, Semantics, and Their Interface 


For present purposes, we take a CVG to consist of three things: (1) a syntax, 
(2) a semantics, and (3) a syntax-semantics interface (hereafter, interface 
simpliciter ). 9 For the fragment developed here, we can take the syntax to be a 
proof theory for a simple multi-applicative categorial grammar. 10 The semantics 
will be another proof theory closely related to the familiar typed lambda calculus 
(TLC). And the interface will recursively define a set of pairs of proofs. The 
two proof theories are both presented in the Gentzen-sequent style of n atura l 
deduction with Curry-Howard proof terms (see e.g. Mitchell and Scott (1 9891 ) 
), because this style of proof theory is visually easy to relate to EST/GB-style 
or HPSG-style linguistic analyses. 


3.1 Semantics 

Rather than the familiar TLC, we employ a new semantic calculus RC (the 
calculus of Responsibility and Commitment 11 which, we argue, is better 
adapted to expressing the semantic compositionality of natural language. (But 
we will also provide a simple algorithm for transforming RC semantic terms into 
TLC, more specifically, into Ty2.) Here we present only the fragment of RC 
needed to analyze covert movement; the full calculus, with the two additio nal 
schemata needed to analyze overt movement, is presented in (Pollard [2008b). 

Like TLC, RC has types, terms, and typing judgments. One important dif¬ 
ference is that in TLC, the variable context of a typing judgment is just a 


9 For the simple fragment developed here, it is easy to read the word order off of the 
syntactic analyses (proof terms). But to do serious linguistics, we also will require a 
phonology and a syntax-phonology interface. Thus CVG is syntactocentric, 
in the sense that syntax has interfaces to phonology and semantics, but only weakly 
so, in the sense that the relations defined by the two interfaces need not be functions. 

10 But in order to extend the theory to cover so-called overt movement phenomena, we 
will nee d to add some form of hypothetical reasoning to the syntactic proof theory 
(Pollard l2008blh 

11 See (Pollard l2008al 'l and references cited there for background and discussion. 












24 


C. Pollard 


set of variable/type pairs, written to the left of the turnstile. But an RC typing 
judgment has a Cooper store, written to the right and demarcated by a co¬ 
turnstile H : 


(4) Format for RC Typing Judgments 

b a : A H A 


The Cooper store is also called the variable co-context 12 ; the ‘co-’ here is 
mnemonic not only for ‘Cooper’; but also for ‘Commitment’ (for reasons to 
be explained presently), for ‘Covert Movement’, and for ‘Continuation’ (since 
the operators stored in them will scope over their own continuations). Thus a 
judgment like is read ‘the (semantic) term a is assigned the (semantic) type 
A in the co-context A .’ 


(5) RC Semantic Types 

a. There are some basic semantic types. 

b. If A and B are types, then A —> B is a functional semantic type with 
argument type A and result type B. 

c. If A, B. and C are types, then 0[A, B, C], usually abbreviated (following 
Shan 2004J) to Ag, is an operator semantic type with binding type 
A, scope type B, and result type C. 13 


(6) Basic Semantic Types 

For present purposes, we use three basic semantic types: 
i (individual concepts), ir (propositions), and n (polar questions). 14 


(7) Functional Semantic Types 

We employ the following abbreviations for (necessarily curried) functional 
types: 

a. Where er ranges over strings of types and e is the null string: 

i. A e —def A 

ii. A B a —def B -> A a (e.g. tt u = 1 l -> 7r) 

b. For n£u, K n =def k<t where a is the string of i’s of length n. 

For n-ary constituent questions where the constituents questioned all 
have type t. E.g. who likes what will get type K 2 - 

(8) Operator Types 

a. These will be the semantic types for expressions which would be ana¬ 
lyzed in TG as undergoing A-movement (either overt or covert). 

12 The full RC calculus, including the schemata for analyzing overt movement, also 
employs ordinary variable contexts to the left of the turnstile. 

13 That is, a term a of type 0[A, B, C] binds a variable x of type A in a term of type 
B, resulting in a term a x b of type C. 

14 Here k is mnemonic for ‘Karttunen’ because its transform (see below) into Ty2 will 
be the Karttunen type for questions. 
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b. The O-constructor is like Moortgat’s (1996|) q-constructor, but it be¬ 
longs to the semantic logic, not the syntactic one. 

c. Thus, for example, while for Moortgat ( 1996 ) a QNP would have 
category q[NP,S,S] and semantic type (i —» tt) —> n, for us it has 
category (simply) NP and semantic type t^. 15 


(9) RC Semantic Terms 

a. There is a denumerable infinity of semantic variables of each type. 

b. There are finitely many basic semantic constants of each type. 

c. There are functional semantic terms of the form (fa), where / and a 
are semantic terms. 

cl. There are binding semantic terms of the form (a x b) where a and b are 
semantic terms and £ is a semantic variable. 

e. But there is no A! 

(10) Cooper Stores 

a. The Cooper stores (co-contexts) will contain semantic operators to be 
scoped, each paired with the variable that it will eventually bind. 

b. We call such stored pairs commitments, and write them in the form 
a x , where the type of x is the binding type of a. 

c. Then we call x a committed variable, and say that a is committed 
to bind x. 


Then the rule schemata of RC are the following: 

(11) Semantic Schema A (Nonlogical Axioms) 

b c : A H (c a basic semantic constant of type A) 

The basic constants notate meanings of syntactic words (see (1261) 1. 

(12) Semantic Schema M (Modus Ponens) 

If b / : A -> B H A and b a : A H A', then b (/ a) : B H A; A' 

a. This is the usual natural-deduction (ND) Modus Ponens, except that 
co-contexts have to be propagated from premisses to conclusions. 

b. Semicolons in co-contexts represent set union (necessarily disjoint, since 
variables are always posited fresh). 

(13) Semantic Schema C (Commitment) 

If b a : Ag H A then b x : A H a x : A (x fresh) 

a. This is a straightforward ND formulation of Cooper storage. 


15 Actually QNPs have to be polymorphically typed. See (Pollard l2008al . fn. 4). 
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b. It gen eralizes Carpenter’s ( 199 7) Introduction rule for Moortgat’s 
( 199111 (essentially the special case of q where the scope type and 
the result type are the same), but in the semantics, not in the 

syntax. 

(14) Semantic Schema R (Responsibility) 

If b b : B H a x : AA then b ( a x b ) : C H A (x free in b but not in A) 

a. This is a straightforward ND formulation of Cooper retrieval. 


b. It generalizes Carpenter’s (1997) Elimination rule for Moortgat’s j)', 
but, again, in the semantics, not in the syntax. 

c. It is called Responsibility because it is about fulfilling commitments. 


To give the reader a familiar point of reference, we provide a transform 
of RC into the standard higher-orde r sem antic representation language Ty2 
(Gallin[l975). 16 We follow Carpenter ( 1997 . Sect. 11.2) in using individual con¬ 
cepts as the basic type for NPs. But we use use the Gallin/Montague names 
for the basic types e (entities, Carpenter’s individuals), t (truth values, Carpen¬ 
ter’s booleans), and s (worlds), rather than Carpenter’s Ind, Bool, and World 
respectively. Hence our (and Montague’s) type s —> e for individual concepts 
corresponds to Carpenter’s type World —> Ind. 17 

We also follow Carpenter’s convention that functional meaning types take 
their world argument last rather than first, e.g. the type for an intransitive verb 
is (s —> e) —> s —> t (the transform of RC type t —s- 7r) rather than s —> (s —» 

e) —»t, so that the verb meaning combines with the subject meaning by ordinary 
function application. 

The price, well worth paying, is that, except for individual concepts and propo¬ 
sitions, our Ty2 meanings are technically not intensions (functions from worlds). 
Consequently the extension at a world w of a Ty2 meaning is defined by recursion 
on types as follows: 


(15) Ty2 Meaning Types 

a. s —> e (individual concepts) is a Ty2 meaning type. 

b. s —> t (propositions) is a Ty2 meaning type. 

c. If A and B are Ty2 meaning types, then so is A —> B. 


(16) Extensional Types Corresponding to Ty2 Meaning Types 

These are defined as follows: 

16 This transform is not a proper part of our framework, but is provided in order to 
show that familiar meaning representations can be algorithmically recovered from 
the ones we employ. Readers who are not concerned with this issue can just ignore 
this transform. 

17 Types for Ty2 variables are as follows: x, y, z : s —> e (individual concepts); p, q : s —> 
t (propositions); w : s (worlds); and P, Q : (s —> e) —> s —* t (properties of individual 
concepts). 
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a. E(s —» e) = e 

b. E(s —> t) = t 

c . E(A -> B) = A -> E(B) 

(17) Extensions of Ty2 Meanings 

The relationship between Ty2 meanings and their extensions is axioma- 
tized as follows, where the family of constants ext^ : s —> M —>■ E(T) is 
parametrized by the Ty2 meaning types: 18 

a. b V x V u ,(ext 11 ,(x) = x(w) (for x : s —> e) 

b. b VpV u ,(ext u) (p) = p(w) (for p : s —> t) 

c . b V/V u ,(ext u ,(/) = X x ext w (f(x)) (for / : A —s- B, A and B Ty2 meaning 
types. 

(18) The Transform r from RC Types to Ty2 Meaning Types 

a. r(t) = s —> e 

b. t(7t) = s —> t 

c . t(k) = t(tt) —> t(tt) 

d. t(A —> B) = t(A) —> t(B) 

e- r{A%) = (t(A) -> r(R)) -> r(C) 

(19) The Transform r on Terms 

a. Variables and basic constants are unchanged except for their types. (We 
make abundant use of meaning postulates, e.g. (HH rather than giving 
basic constants nonbasic transforms.) 

b. t((/ a)) = r(/)(r(a)) 

The change in the parenthesization has no theoretical significance. It 
just enables one to tell at a glance whether the term belongs to RC or 
to Ty2, e.g. (walk' Kim') vs. walk'(Kim’). 

c. r((a x b)) = r(a)(A x r(&)) 

(20) Ty2 Meaning Postulates for Generalized Quantifiers 

h every’ = \q\ p \ w V x (Q{x)(w) —> P(x)(w)) 
h some' = XqX p X w 3 x {Q(x)(w) A P(x)(w)) 

b everyone’ = every’(person') 
b someone' = some’(person') 


18 


We omit the type subscript A on ext a when it is inferrable from context. Moreover 
we abbreviate ext(ui) as ext™. 
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3.2 Syntax 

For the fragment developed here, our syntactic calculus is just a simple multi¬ 
modal applicative CG. 19 Again, there are types, now called (syntactic) cate¬ 
gories, terms, and typing judgments, which have the form 

(21) Format for CVG Syntactic Typing Judgments 

b a : A 

read ‘the (syntactic) term a is assigned the category A.’ 

(22) CVG Categories 

a. There are some basic categories. 

b. If A and B are categories, so are A -op B , where F belongs to a set 
F of grammatical function names 20 ; these are called functional 
categories with argument category A and result category B. 

(23) Basic Categories 

For now, just S and NP. 

(24) Functional Categories 

We start off with the grammatical function names S (subject) and C (com¬ 
plement). 21 Others will be added as needed. 

(25) CVG Syntactic Terms 

a. There are finitely many (syntactic) words of each category. 

b. There are syntactic functional terms of the forms (/ a F ) and ( F / a) 

(26) (Syntactic) Words 

a. These correspond not just to Bloomfield’s “minimal free forms”, but 
also to minimal syntactic units realized phonologic ally as phrasal affixes, 
sentence particles, argument clitics, etc. 

b. Some of these might be realized nonconcatenatively, e.g. by pitch ac¬ 
cents, (partial) reduplication, phonological zero (inaudibility), etc. 


19 But to analyze overt movement, it will have to be extended with schem ata for traces 
and syntactic binding by ‘overtly moved’ syntactic operators (Pollard ( , 2008bh 'l. 

20 Thus grammatical functions are abstract tectogrammatical primitives, and not de¬ 
fined in terms of word order, phonology, or the positions in which they occur in 
proof trees. And so the role grammatical functions play in CVG is strongly anal¬ 
ogous to the role that they play in such frameworks as HPSG, lexical-functional 
grammar (LFG), and relational grammar (RG). Multiple modes of implication can 
be replaced by a single linear implication (see (de Groote et al. [20091 ') for details), at 
the expense of considerably elaborating the set of basic types. 

21 Here CVG betrays its HPSG pedigree. 
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(27) Syntactic Functional Terms 

a. In principle these could always be written (/ a F ), but we write (/ a c ) 
and ( s a f) as a mnemonic that in English subjects are to the left and 
complements to the right. 

b. This enables us to read the word order off the syntactic terms, as in 
EST/GB labelled bracketings. 

The CVG syntactic rule schemata are as follows: 

(28) Syntactic Schema W (Words) 

b w : A (w a syntactic word of category A) 

(29) Syntactic Schema Mg (Subject Modus Ponens) 

If b a : A and b / : A —o$ B, then b ( s a /) : B 

(30) Syntactic Schema Me (Complement Modus Ponens) 

If b / : A —°c B and b a : A, then b (/ a c ) : B 

3.3 The CVG Syntax-Semantics Interface 

The interface recursively specifies which syntactic proofs are paired with which 
semantics ones. Unsurprisingly, the recursion is grounded in the lexicon: 

(31) Interface Schema L (Lexicon) 

b w, c : A, B H (for certain pairs (w, c ) where w is a word of category A 
and c is a basic constant of type B) 

The following two schemata are essentially ND reformulations of HPSG’s 
Subject-Head and Head-Complement schemata: 

(32) Interface Schema Mg (Subject Modus Ponens) 

If b a, c : A, C H A and b /, v : A -^>g B,C —> D H A' 
then b ( s a /), (f c) : B,D H A; A' 

(33) Interface Schema Me (Complement Modus Ponens) 

If b /, v : A —oq B,C —> D H A and b a, c : A, C H A 1 
then b (/ a c ), [v c) : B, D H A; A' 

And finally, the following two rules, both of which leave the syntax unchanged, 
are ND reformulations of Cooper storage and retrieval, respectively. 

(34) Interface Schema C (Commitment) 

If b a, b : A, Bq H A, then b a, x : A, B -\ b x : B <5; A (x fresh) 

(35) Interface Schema R (Responsibility) 

If b e,c : E,C ~\ b x : B ®; A then b e, ( b x c ) : E,D H A 
(x free in c but not in A) 

It should be noted that, since co-contexts are sets, not lists, retrieval is nonde- 
terministic not only with respect to which node in the proof tree it takes place 
at, but also with respect to which of the stored operators is retrieved. 
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4 Analysis of Quantifier Raising in English 

Our English fragment will employ the following lexicon. By convention, for any 
lexical entry, the words and the semantic constants are presupposed to have 
already been licensed, respectively, by the syntactic and semantic logics. 

(36) Lexicon for English Fragment 

b Chris, Chris’ : NP, t 3 (likewise other names) 
b everyone, everyone' : NP, 3 
b someone, someone' : NP, i £ 3 
b likes, like' : NP -^>c NP ^>s S, t — > i — » 7r 3 
b thinks, think’ : S NP -^>g S, 7r — + i — > 7r 3 

(37) A Simple Sentence 

a. Chris thinks Kim likes Dana. 

b. b ( s Chris (thinks ( s Kim (likes Dana c ) c ))) : 

((think' ((like' Dana') Kim')) Chris’) :S,7 t3 

c. Ty2: think’(like'(Dana’)(Kim ')) (Chris’) 

(38) Quantifier Scope Ambiguity 

a. Chris thinks Kim likes everyone. 

b. Syntax (both): 

( s Chris (thinks ( s Kim (likes everyone G ) c ))) : S 

c. Semantics (scoped to lower clause): 

RC: ((think' (everyone’ J ,((like' x) Kim’))) Chris') : tt 
Ty2: think’(A u ,(V x (person'(a:)('u;) —» Iike’ (a:) ( Kim’) ( tn )))) (Chris’ ) : s —> t 
cl. Semantics (scoped to upper clause): 

RC: (everyone’ 2 ,((think' ((like' x) Kim’)) Chris')) : 7r 

Ty2: A u ,(V x (person'(a:)(M;) —» think'(like’(a;)(Kim’))(Chris')(w;))) : s —»t 

(39) Raising of Two Quantifiers to Same Clause 

a. Everyone likes someone. 

b. Syntax (both): ( s everyone (likes someone c ) c ) : S 

c. V3-reading (RC): (everyone' a .(someone’j / ((like' y) a;))) : 7r 
cl. 3V-reading (RC): (someone' y (everyone' x ((like' y) a;))) : 7r 

e. These are possible because for generalized quantifiers, the result type is 
the same as the scope type. 

f. Things are not so straightforward in the case of multiple in-situ wh- 
operators, as we will see in the next section. 
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5 Background for the Analysis of Wh-in-Situ 


In dealing with the semantics of (possibly multiple) in-situ constituent questions, 
we take as our target (Ty2) semantics a variant (Pollard 2008c ) of Karttunen’s 
(1977) semantics of interrogatives, which analyzes interrogative denotations as 
sets of propositions. We follow Karttunen in the case of polar questions; but for 
n-place constituent questions, we take the denotation to be (the curried form 
of) a function from n-tuples to propositions: 22 


(40) Types for Polar Questions 

a. RC meaning type: k 

b. Meaning type of Ty2 transform: (s —> t) —> s —> t (property of 
propositions) 

c. Type of Ty2 extension: (s —» t) —> t (characteristic function of) a (sin¬ 
gleton) set of propositions) 

d. Example: at w, Does Chris walk (or whether Chris walks) denotes the 
singleton set whose member is whichever is true at w, the proposition 
that Chris walks or the proposition that s/he doesn’t. 

(41) Types for Unary Constituent Questions 

a. RC meaning type: k\ 

b. Meaning type of Ty2 transform: (s —> e) —> (s —> t) —> (s —> t) (function 
from individual concepts to properties of propositions). 

c. Type of Ty2 extension: (s —> e) —* (s —> t) —> t (function from individual 
concepts to sets of propositions). Technically, the curried version of the 
characteristic function of a certain binary relation between individual 
concepts and propositions. 

d. Example: at w, who walks denotes the (functional) binary relation be¬ 
tween individual concepts x and propositions p that obtains just in case 
x is a w-person and and p is whichever proposition is a w-fact, that x 
walks or that x does not walk. 

(42) Types for Binary Constituent Questions 

a. RC meaning type: «2 

b. Meaning type of Ty2 transform: (s —» e) —> (s —> e) —> (s —> t) —> (s —> 

t) (curried function from pairs of individual concepts to properties of 
propositions). 

22 A set of propositions can then be recovered as the range of this function. This set 
differs from the Karttunen semantics in having both positive and negative ‘atomic 
answers’ as members. Additionally, our interrogative meanings yield a refinement 
of the Groenendijk-Stokhof partition semantics by taking the induced equivalence 
relation on worlds. See 1 Pollard l2008cf) for detailed discussion. 
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c. Type of Ty2 extension: (s —> t) —> (s —> e) —> (s —> t) —> t (cur¬ 
ried function from pairs of individual concepts to sets of propositions). 
Technically, the curried version of the characteristic function of a cer¬ 
tain ternary relation between individual concepts, individual concepts, 
and propositions. 

d. Example: at w, who likes what denotes the (functional) ternary relation 
between individual concepts x and y and propositions p that obtains just 
in case £ is a w-person, y is a w-tliing, and p is whichever proposition 
is a ly-fact, that x likes y or that x does not like y. 

The fact that not all questions have the same type complicates the analysis of 
in-situ multiple constituent questions as compared with the analysis of multiple 
quantifier retrieval For example, scoping one in-situ w/i-operator at a propo¬ 
sition produces a unary constituent question, so its type must be tJJ 1 . Thus, if we 
want to scope a second in-situ w/i-operator over that unary constituent question 
to form a binary constituent question, then its type must be and so forth. So 
unlike QNPs, w/i-operators must be (in principal infinitely) polymorphic. Note 
that this polymorphism has nothing to do with the depth of embedding of the 
sentences at which the operator is retrieved, but only with the operator’s scoping 
order (in the sequence of all the wh-operators scoped within a given sentence). 

Our analysis will make use of a number of Ty2 logical constants, defined by 
the following meaning postulates: 

(43) Ty2 Meaning Postulates for Some Useful Logical Constants 

a. b id„ = X z Z (for Z : t(k„)) 

b. b and' = X p X q X w (p(w) A q(w)) 

c. b or’ = X p X q X w (p(w) V q(w)) 

d. b not' = X p X w ->p(w) 

e. b equals’ A = X x X y X w (x = y) 

f. b whether' = X q X p (p A ((p equals’ q) V (p equals’ not'(g)))) 

g. b which 11 = XqXpX x X p (Q(x) and' whether'(P(x))(p)) 

h. b which" = XqX z X Xo ... X Xn X p (Q(x) and' Z(x 0 ) ■ ■ ■ (x n )(p)) ( n > 0) 

The last two are the Ty2 meanings of the interrogative determiner which. We do 
not include determiners in this fragment, but these meanings are used to define 
the following nonlogical constants: 

(44) Ty2 Meaning Postulates for some Nonlogical Constants 

For n £ u>: 

a. b who" = which"(person') 

b. b what" = which"(thing') 
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6 Chinese Interrogatives 

We turn now to the analysis of so-called covert wj/i-movement in Chinese. 23 

Our Chinese fragment uses the same types, categories, and (semantic, syntac¬ 
tic, and interface) rule schemata as the English, but a different lexicon: 

(45) Lexicon for Chinese Fragment 

b Zhangsan, Zhangsan’ : NP, i H 

b xihuan, like’ : NP —°c NP -^>g S, l —> t —> 7r H 

b xi-bu-xihuan, like?' : NP —NP ^>g 

b xiang-zhidao, wonder’ ra : S NP —og S, K n —> l —> n H 

b shei, who 0 : NP, t” 1 H 

b shei, who" : NP, d (for n > 0) 

b shenme, what 0 : NP, t” 1 H 

b shenme, what" : NP, lZZ +1 d (for n > 0) 

(46) Meaning Postulate for an Interrogative Verb Meaning 
b like?’ = A y A a; whether , (like'(y)(a:)) 

Note that xibuxihuan ‘like?’ is a partial-reduplicative interrogative verb form, 
used for forming (both root and embedded) polar questions. The verb xiang- 
zhidao ‘wonder’ has to be type-schematized according to the type of question 
expressed by the sentential complement. And the s/i-interrogative words have 
to be type-schematized according by their scope type (and corresponding result 
type). This fragment produces analyses such as the following: 

(47) A Simple Chinese Sentence 

a. Zhangsan xihuan Lisi. 

b. Zhangsan like Lisi 

c. Zhangsan likes Lisi.’ 

d. b ( tS Zhangsan (xihuan Lisi c )) : S 

e. Ty2: b I ike’ (Lisi ’) (Zhangsa n ’) : t(tt) 

(48) A Chinese Polar Question 

a. Zhangsan xi-bu-xihuan Lisi? 

b. Zhangsan like? Lisi 

c. ‘Does Zhangsan like Lisi?’ 

d. b ( tS Zhangsan (xi-bu-xihuan Lisi c )) : S 

e. Ty2: b whether'^ike^Lisi’^Zhangsan')) : t(k 0 ) 

23 The a nalysis we will propose here improves on an earlier version (Pollard l2007al . 
l2007bl l which required construction-specific rules for different in-situ operators. 
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(49) A Chinese Unary Constituent Question 

a. Zhangsan xihuan shenme? 

b. Zhangsan like who 

c. ‘What does Zhangsan like?’ 

d. b ( s Zhangsan (xihuan shenme c )) : S 

e. RC: b (what°((like’ y) (Zhangsan')) : K\ H 


(50) A Chinese Binary Constituent Question 

a. Shei xihuan shenme? 

b. who like what 

c. Who likes what? 

d. b ( s Shei (xihuan shenme c )) : S 

e. RC: b (who*(what°((like' y) (a;))) : k 2 b 

f. RC: b (what*(who°((like' y) (a;))) : k 2 b 

The ambiguity is inessential: the two functions are the same modulo per¬ 
mutation of their arguments. 


Finally, we consider so-called Baker-type ambiguities. Baker 
that English sentences like the following are ambiguous: 


( 197011 


noticed 


(51) Baker-Type Ambiguity in English 

a. A: Who knows where we bought what? 

b. B: Chris does. (Appropriate when what scopes to the embeded question.) 

c. B: Chris knows where we bought the books, and Kim knows where 
we bought the records. (Appropriate when what scopes to the root 
question.) 

d. The ‘overtly moved’ wh-ex pressions must scope at their ‘surface’ posi¬ 
tions: who can only scope to the root question, and where can only scope 
to the embedded question. 

e. But the in-situ w/i-expression what can scope high or low. 


A full account of thus phenomenon in English depends on an analysis of ove rt 
movement, which is beyond the scope of this paper (but see (Pollard [2008a)). 
Instead, we analyze the corresponding facts of Chinese, which involve only covert 
movement. 


(52) A Chinese Baker-Type Wh-Scope Ambiguity 

a. Zhangsan xiang-zhidao shei xihuan shenme./? 

b. Zhangsan wonder who like what 

c. b ( s Zhangsan (xiang-zhidao ( s shei (xihuan shenme c ) c ))) : S 

d. b ((wonder’ 2 (who^,(what°((like' y) x)))) Zhangsan') : 7r H 
‘Zhangsan wonders who likes what.’ 
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e. b (who°((wonder’i (what°((like' y) a:))) Zhangsan’) : m H 
‘Who does Zhangsan wonder what (that person) likes?’ 

f. b (what'^(wonder’ i (who°((like' y) x))) Zhangsan') : H 

‘What does Zhangsan wonder who likes?’ 

(53) The Gist of the Preceding 

a. Both s/i-expressions are in situ, so they can each scope high or low. 

b. If both scope low (152djl . then the root sentence expresses a proposition 
and the embedded sentence expresses a binary question. 

c. If one scopes high and the other low (I52cl52ip . then the root sentence 
and the embedded sentence both express unary questions. 

d. But they cannot both scope high, since then the complement sentence 
would express a proposition, while the first argument of wonder’ must 
be a question. 

7 Conclusion 

We have presented a new, simple, and formally precise account of so-called covert 

movement phenomena. The key ideas of the account are these: 

(54) The Key Ideas Summarized 

— As in CG, both the syntax and the semantics of a linguistic expression 
are proofs. 

— But unlike mainstream CG, the syntax-semantics interface is not a func¬ 
tion, so operator-scope ambiguities need not have syntactic reflections. 

— Thus the syntax is simple. 

— And unlike TG, the interface is not a nondeterministic process made 
up of sequences of structural operations on trees. 

— Instead, it is just a recursive specification of which proof pairs go to¬ 
gether (parallel derivational architecture). 

— The key insights embodied in the the semantic logic RC go back to the 
1970s: Cooper’s storage and retrieval. 

— The RC formulation generalizes Carpenter’s ND rules for Moortgat’s 
■fT, but only in the semantic logic (not the syntactic one). 

— The transform from RC to TLC is simple. 24 


24 It would be instructive to understand the the connection betwee n this transfor m and 
ones employed in many re cent C G approaches (e.g. (de Groot e l2001al : Barker 120021 : 
Shan l200^ . l2004l: Mo ortgat l2007l: Ber nardi and Moortgat [200?jV) based on CPS trans¬ 
forms fPlotkin Il975l : Felleisen 19881 : Danvy and F ilinski l 1 9901 : Parigot Il992l . l2000l : 
Curien and Herbelin l2000l '). 
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A number of issues remain to be addressed. For one thing, the relationship 
between covert and overt movement ne eds to be clarified. Some preliminary steps 
in this direction are taken in ('Pollard l2008bi 2008d). In essence, the approach 
taken there is to reconstruct the analysis of overt movement in Gazdar ([1981) 

, using (abstract) syntactic operators paired with operator meanings of the the 
same general character as those that occur in the co-context. Such syntactic 
operators bind a syntactic variable (‘trace of overt movement’) in a sentence in 
much the same way that a quantifier retrieved from the co-store binds a semantic 
variable in a proposition, except that rather then being retrieved, it is just an 
ordinary logical premiss. 

Second, it remains unclear how ultimately to make sense of the co-store, and 
the storage and retrieval mech anism s, in logical (or categorical) terms. In this 
connection, de Groote et al. ( 2009 ) show that the analysis of covert move¬ 
ment set forth above can be assimilated to the CVG analysis of overt movement 
just mentioned, provided we analyze an in situ operator as an ordinary premiss 
with an operator type, which, when applied to its ‘gappy’ sentential argument, 
in effect lowers itself into the trace position via /3-reduction. 25 In other words, 
a CVG with co-store can be algorithmically converted into an ordinary multi¬ 
modal categorial grammar without co-store, with CVG derivations being globally 
transformed into ordinary proofs that make no use of storage or retrieval. 

This state of affairs is vaguely analogous to CPS transforms that map pro¬ 
grams with control operators into pure functional programs. But what is missing 
is a convincing logical or categorical characterization of the CVG-to-CG trans¬ 
form. In the absence of such a characterization, perhaps the best face we can 
put onto the the storage-and-retrieval machinery is that it provides a kind of 
syntactic sugar for linguists with a taste for surface-oriented syntax. To put a 
more positive spin on it, the de Groote et al. transform can be taken as estab¬ 
lishing that Cooper-style storage-and-retrieval machinery actually has a precise 
meaning (which is given by the transform). 

A third, and potentially more serious challenge for the framework presented 
above is the exist ence of the linguistic phenomenon of parasitic scope discussed 
by Barker (120071 ) . This has to do with seemingly quantificational expressions 
whose scope, as Barker puts it, ‘depends on the scope of some other scope-taking 
element in the sentence’. For example, in the following sentences 


(55) 

a. Anna and Bill read the same book. 

b. John hit and killed the same man. 


the interpretations of the NPs introduced by the same depend on the interpre¬ 
tations of the coordinate expressions Anna and Bill and hit and killed. Barker 
argues persuasively that such phenomena resist coherent analysis under famil¬ 
iar approaches to quantifer scope, and offers a type-logical analysis that makes 

25 This analysis is broadly similar to Oehrle’s ( 1994 1 simulation of Montagovian 
‘quantifying in’, except that on Oehrle’s approach, the /3-reduction that effects the 
‘lowering’ is in the concrete (rather than the abstract) syntax. 
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use of both continuations and choice functions. Ongoing investigation of para¬ 
sitic scope (broadly construed to include similar phenomena such as remnant 
comparatives and internal readings of superlatives) suggest that, although nei¬ 
ther continuations nor choice functions are required for the analysis of par¬ 
asitic scope, a convincing characterization of such constructions in terms of 
storage-and-retrieval is simply not available. 26 If so, then it may well be that, 
after 35 years of yeoman service, storage-and-retrieval technology is overdue for 
retirement. 
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Abstract. This paper explores some of the connections between mini¬ 
malist grammars and categorial grammars in the tradition of the Lambek 
calculus and its various extensions. It provides a new graphical perspec¬ 
tive on the lexical items of both minimalist and categorial grammars 
and uses this perspective to suggest a reconciliation between two differ¬ 
ent ways of embedding minimalist grammars into categorial grammars, 
while keeping the good properties of both. 


1 Introduction 

Since their introduction, it was clear that the minimalist program [T] used ideas 
close to resource-sensitive grammar formalisms such as (extensions of) the Lam¬ 
bek calculus [2]. This is especially clear in Stabler’s formalization of the mini¬ 
malist program [3]. The links between categorial and minimalist grammars have 
been widely investigated [U[5JIE[[71IH|, with the goal to ‘rebuild the minimalist 
program on a logical ground’ 0. 

Minimalist grammars are a lexicalized formalism with two operations: a tree 
building operation merge and a tree reconfiguration operation move. While there 
is little divergence between different authors on how to model the merge operation, 
there are (at least) two different ways of modeling move , exemplified by [3] and |5j. 

The goal of this paper is twofold. Firstly, I will introduce a graph-based pre¬ 
sentation of proofs for (extensions of) the non-associative Lambek calculus NL. 
These proof nets — besides being useful as a computational device 0GED] — also 
make the correspondence between minimalist and categorial grammars visually 
clear. Secondly, proof nets will provide a simple path to a unified treatment of 
the move operation. As a surprising set of supporting data for this unified treat¬ 
ment, I will show how this unified approach is in fact very close to the one used 
for large-coverage categorial grammars. 


2 Merge and AB Grammars 


This section introduces the basics: AB grammars, minimalist grammars and the 
substitution and merge operations in addition to several useful theoretical notions. 
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2.1 AB Grammars 

AB grammars, named after the pioneering work of Ajdukiewicz and Bar-Hillel 
mm, can be seen as a restriction on the formulas of the Lambek calculus in 
the sense made clear by the following definition. 

Definition 1 . Given a set of atomic formulas A the formulas of AB grammars 
are defined as follows. 


Af ::= A | Af/V \ V\Af 
V ::= A | V*V 

The definition distinguishes between negative formulas Af and positive formu¬ 
las V. The left and right implication ‘\’ and ‘/’ can only be used on negative 
(sub)formulas whereas the conjunction or product V can only be used on posi¬ 
tive (sub)formulas. 

Definition 2. For the set of negative formulas J\f the set of antecedent trees T 
is defined as follows. 


r ■■■■= Af | (T,T) 

That is to say, an antecedent tree is a (non-empty) binary branching tree with 
negative formulas as its leaves. 

Definition 3. A sequent is a pair (T, V) — written T b V — where T is an 
antecedent tree and V is a positive formula. 

Table m lists the rules for AB: the axiom rule [Ax] states that the antecedent 
tree containing just a single formula A as its leaf is a tree of type A. The rule 
[•7] allows us to combine two arbitrary trees of type A and B into a tree of type 
A»B. 

The rule [\E] states that whenever we have shown an antecedent tree A to be 
of type A\B and an antecedent tree r to be of type A then we can construct a 
complex tree (T, A) of type B. In other words an antecedent tree of type A\B 
combines with an antecedent tree of type A to its left to form an antecedent 
treeof type B. Symmetrically, the rule \/E] allows an antecedent tree of type 
B / A to combine with and antecedent tree of type A to its right to form an 
antecedent tree of type B. 


Table 1 . Natural deduction rules for AB grammars 

A\- A Eh B 

>A 

l/E\ 


- [ Ax ] 

Ah A 

Eh A AhA\B 


[\£] 


(A,E) h A»B 
Ah B/A Eh A 


(E,A)hB 


(A, E)h B 
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2.2 Trees and Substitution 

The rules for the connectives are perhaps best seen as tree construction rules. 
Figure [U shows the rules for the connectives of Tabled in tree form. 



Fig. 1. The rules of AB as tree construction rules 


Given a list of formulas, we can determine by looking at just the form of the 
formulas which rules we need to apply in order to form a correct derivation. 
This is especially clear in the tree form: we recursively decompose each of the 
formulas until we reach the atoms. 

Figure [2] shows this pre-compilation of the rules for the sequence of formulas. 

d, (d\v) / d, d / n,n 

An example of lexical assignments which would correspond to this sequence 
is shown inside of the square boxes. All trees have a negative atomic formula as 
their root and all leaves are positive atomic formulas — with the exception of 
the lexical formula which is negative and can be complex. 


v 


d 



d 


(d\ v) / d 


d / : 


Tama 


speaks 


many 


languages 


d Tama =d=dv speaks 


=n d many 


n languages 


Fig. 2. Lexical Unfolding 
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Fig. 3. Axiom/Cut as Substitution 

In order to use these trees as a system for deriving sentences, we only need 
to add a single rule: the substitution rule — shown in Fig. [31 which allows us 
to substitute a tree A with root A for a leaf A in a tree F . In other words, it 
identifies a positive and a negative occurrence of the same atomic formula. 

This is exactly the substitution rule from tree adjoining grammars US] and 
from the view of natural deduction, it corresponds to the possibility of substi¬ 
tuting a proof A h A for a proof of r[A] b C with axiom A b A to obtain a 
proof of r[A\ b C. 



Fig. 4. Result after two d substitutions and one n substitution 

Figure U] shows how one application of the substitution rule for the n atomic 
formula and two applications of the substitution rule for the np formula produce 
a tree of type s with ‘Tama speaks many languages’ as its yield. 
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2.3 Merge 

In Stabler’s formalization of Chomsky’s minimalist grammars DEI, grammars 
have a set of features: syntactic features, which play an active role in the deriva¬ 
tions of the formalism and non-syntactic features, which are the phonetic symbols 
(and semantic symbols, but in the current paper these will not be treated at the 
same level as the phonetic features) in the lexicon — the words and morphemes 
of the language. 

Given a set of base syntactic categories B , typically containing categories like n 
for nouns, d for determiner phrases, v for verb phrases and c for complementizers, 
the selectors for these base categories, written =A for a base category A , select 
a base category of type A to form a complex tree. 

The lexicon assigns lists of features to words and morphemes — possibly in¬ 
cluding the empty string. The grammatical functions merge and move construct 
and rearrange trees by eliminating matched pairs of features from the first posi¬ 
tions of their respective lists. Throughout this paper, I will use A to denote the 
first feature on the feature list — the head or car, depending on your favorite 
progamming language — the and Bs to denote the rest of the features — the 
tail or crd. In addition, I will often differentiate among the different possibilities 
for A, using = A for the first element of a feature list which has the additional 
requirement that it has a selector feature. 

The function merge allows a lexical list of features with first feature =A to 
combine with a tree with principal feature A. We distinguish two cases: in case 
=A is a lexical feature, it combines with a (possibly complex) tree r of category 
A to its right. In case =A is derived, it combines with tree T of category A to 
its left. Figure [5] shows the two cases in tree forirQ. The similarity with Fig. [I] 
should be clear, but let’s remark the differences explicitly. First of all, there is 
no equivalent of the [•/] rule in minimalist grammars. More importantly, merge 
imposes a clear order on the selection of the arguments: the first argument needs 
to selected to the right whereas any additional arguments need to be selected to 
the left. 

As a consequence, minimalist grammars which use merge only are — from 
the point of view of AB grammars — a restriction on the allowed formulas. 
The lexical categories of Fig. [2] satisfy the required restrictions, as shown by the 
corresponding lists of minimalist features beneath each lexical tree. 

Minimalist grammars allow the same type of lexical unfolding as we used 
to AB grammars. Figure |G] shows this explicitly: the unfolding of the list of 
features for ‘speaks’ produces the tree on the left, whereas the unfolding of the 
corresponding AB formula produces the tree on the right. Discounting the labels 
on the internal nodes and the lexical leaves, the two trees are identical. 

Since the substitution rule operates only on the positive atomic leaves and 
the negative roots of the trees, we can simplify our tree representations by 

1 I will not consider Stabler’s head movement categories here. In addition, I will fol¬ 
low ^ in not distinguishing between left-headed and right-headed structures, though 
it is easy to add this information in case of need mm show how this can be done 
in the general and in the minimalist case]. 
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Bs 


=A Bs 



Bs 



—A Bs 



Lexical Derived 

Fig. 5. The minimalist merge operation 


v 


d =d v 


~d=dv d 


speaks 


v 




(d\v) / d d 

speaks 


Fig. 6. A transitive verb in MG and in AB grammars, with a simplified representation 
for both in the middle 


erasing all complex formulas and non-singleton feature lists, obtaining a simpli¬ 
fied representation which is functionally equivalent to the fully annotated trees 
and from which we can — when needed — extract all information which has 
been erased. 

3 Move and Extensions of Lambek Grammars 

The correspondence between merge and AB grammars of the previous section is 
very simple and direct. In order to model the minimalist move operation as well, 
we will need some more machinery. First, I will follow Lambek mm by adding 
the ‘missing’ rules for the connectives to AB. That is to say, we add the logical 
rules for negative occurrences of the conjunction and for positive occurrences of 
the implications. 

Secondly, since move rearranges a tree by moving a marked subtree to the 
front, essentially changing the word order of the tree yield, I will follow nsum 
by adding modally controlled versions of the associativity and commutativity 
rules to the calculus. 
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3.1 The Non-associative Lambek Calculus 

Lambek’s seminal paper [2] extends the rules of AB (Table [TJ by adding the 
introduction rules for the implications — which show how to use positive B / A 
and A\B formulas — and the elimination rule for the product — which shows 
how to use negative A • B formulas — to the calculus. In addition to giving 
numerous linguistic uses of the calculus, Lambek proves basic results like cut 
elimination and decidability. 

The non-associative Lambek Calculus NL m is a restriction of the associative 
Lambek Calculus L. It requires all antecedents to be trees of formulas, where 
antecedents for L were simply lists of formulas. 


Table 2. Natural deduction rules for NL grammars 

[Ax] 


A h A 

Ah A» B r[(A,B)\hC 
r[A] h c 
r h A Ah A\B 


E\ 


Or, A) h b 
A h B / A Bh A 
(A,B)hB 


[\£] 

[/E] 


Ah A Bh B 
(A,B) h A»B 
(A, B) h B 


Bh A\B 
(r,A)hB 
Bh B / A 


M 

l/A 


Definition [2] in Sect. 12.ll already uses trees of formulas as antecedents, so our 
basic sequents are suitable objects for NL. Table [2] shows the natural deduction 
rules for NL. 

The [•£] rule is the most complicated of the new rules. Figure [7] shows the 
rule in a graphical representation. On the left hand side of the figure, we have 
the two premisses of the rule, a structure A with a negative root formula A • B 
and a structure B with a negative root formula of type C. This structure r 
has the additional property that it has a subtree occurrence (A, B) — that is 
to say a binary branch with positive leaves A and B , in that order — which is 
displayed explicitly. For the conclusion of the rule, shown on the right hand side, 
the subtree (A, B) has been replaced by A. 

The structure in the middle of Fig.[7]shows an intermediate structure we will 
use for computational purposes. It splits up the conversion of the structure on 
the left into the structure on the right into two steps. The first step, indicated by 
the first arrow, ‘unfolds’ the A* B formula into its A and B sub-formulas, while 
keeping track of the position of the main formula by means of an arrow. The 
new constructor, which is portrayed with a filled center and which we will call an 
auxiliary constructor, connects the two structures shown on the left hand side, 
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Fig. 7. A graphical representation of the [»E\ rule 


but in such a way that the result is no longer a tree@ The use of ovals instead 
of triangles instead of triangles in the figure is to remind us of the fact that the 
structures r and A are no longer necessarily trees. The second step contracts an 
auxiliary and a tree constructor which are connected at both ports which do not 
have the arrow, eliminating the two constructors and the intermediate A and 
B nodes while keeping r and A connected. The intuition behind the auxiliary 
constructor and the corresponding contraction should be clear: it verifies that A 
and B occur as sister leaves of a tree constructor then erases both of them. 

Figure [5] shows the [/7] rule in graphical form — the [\I\ rule is left-right 
symmetric. Here, the premiss of the rule is a graph (_T, A) of type B , as shown 
on the left of the figure. The conclusion of the rule is the graph r of type 
B / A. Again, we divide the calculation into two steps: the first step consists of 
decomposing the goal formula B / A into its B and A subformulas and connecting 
these subformulas to the B root and A leaf of the graph on the left to produce 
the structure in the middle of the figure. Then, as before we contract the two 
links which are shown in the middle of the figure to arrive at the final structure 
on the right hand side. The intuition behind the [/7] contraction is then: verify 
that a formula A occurs as the right daughter of a B tree, then delete both to 
obtain an B / A graph. 

Remark that this contraction is similar to the contraction shown in Fig. 0 in 
both cases an auxiliary and a tree constructor are connected at the two points 
of the auxiliary constructor which do not have the arrow pointing towards them, 
the two constructors and shared vertices are erased and the graph is reconnected. 

2 In fact, the graphs we obtain are still quite close to trees: they are term graphs, a data 

structure used, for example, to share common sub-terms in arithmetic expressions. 

In this case A • B is shared by two paths from C. 
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B / A 



B / A 

r 

V_ J 


Fig. 8. A graphical representation of the [//] rule 


Similar to the way we have seen in Sect. 12.21 we can recursively unfold a 
lexical formula until we reach its atoms. This time, there will be a possibility 
for each pair of main connective and polarity, though it requires us to slightly 
generalize our structures. In addition to the constructor with an empty circle in 
the center there are now three auxiliary constructors with a filled circle. Each of 
these auxiliary constructors has a single arrow which points to the main formula 
of the link, whereas the two active formulas are not explicitly marked. 

As shown in Fig. [3] there is an auxiliary constructor for each of the connectives 
and the rules for the connectives of positive polarity on the top row are the up- 
down symmetric duals of the connectives of negative polarity on the bottom 
row, with one of the pair being a tree constructor and the other an auxiliary 
constructor. 

In each of the six figures the oval r corresponds to the graph which has been 
constructed so far, with the main formula either on top of it (for the negative 
occurrences) or at its bottom (for the positive occurrences). For reference, the 
inductive definition of formulas (which adds the missing cases to those of Defi¬ 
nition Q] is shown above the unfolding rule for the positive cases and below it for 
the negative cases. 

Remark that after an unfolding step, the positive subformulas are always 
drawn below the constructor while the negative sub-formula are drawn above 
it. The substitution or axiom/cut rule connects positive and negative atomic 
formulas as before. The only difference is that — unlike what is suggested by 
Fig. |3] _ the two connected structures need not be disjoint. 

A graph is correct if we can contract all auxiliary constructors to form a tree. 
Valid structures correspond exactly to natural deduction proofs |S]. 

As an example, Fig. [TUI shows the sentence ‘Someone speaks Yoruba’ unfolded 
on the left hand side of the picture. To save space, the simple d tree ‘Yoruba’ has 
already been substituted for the object determiner phrase of ‘speaks’. As before, 
the information on the internal nodes is superfluous and can be deduced from 
the combination of the graph structure and the formulas at the external nodes. 
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Af :~Af /V 


Af ::= Af • Af 

Fig. 9. Formula decomposition rules 


Af ::= V \Af 


Note that since ‘someone’ has a complex positive subformula d\v, its unfolding 
uses the unfolding for V ::= A f\V shown on the top right of Fig. [9] Intuitively, 
someone searches for a verb phrase d \ v to its right to give a v and we will verify 
this by means of substitutions and contractions. 

Performing the d and v substitutions gives the graph shown on the right. The 
d and v node in the graph are connected respectively to the left and to the 
top node of a tree constructor, which means we are in the proper configuration 
to perform a [\7] contraction. Performing this contraction erases the part of 
the graph shown in the gray box, while identifying the two d\v nodes. After 
the contraction, the remaining part of the graph is a tree, showing the initial 
structure to be valid. 

3.2 Extensions of NL 

Before returning to minimalist grammars, I will quickly discuss the ways to 
extend NL to allow the controlled use of structural rules. The first of these is the 
use of unary control operators m- Unlike the exponentials of linear logic ESI. 
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Fig. 10. Lexical Unfolding and Axioms 


these unary connectives are a simple generalization of the binary ones. They 
have their own constructors and contractions which are shown in Fig. 1111 

The similarity with the contractions for the binary connectives becomes clear 
once we see that the right hand side of Fig. [11]— showing the [OF] contraction 
— is just the contraction for [•£] shown in Fig. [3 but with the A formula and 
its link to the two connectors removed. The [□/] contraction on the left is just 
the [/7] contraction shown in Fig. [5] again with the A formula and links to it 
removed. 

The principles illustrated in Fig. 111! and which we will use in what follows, 
are the following: 


ODA b A Ah DO A 

It states that — in a negative context, such as shown in Fig. [Tl] on the left 
we can contract a ODA structure into an A structure. In other words, it is a 
sort of subtype of A with some special properties. 
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DA 




OaA 



Fig. 11. Constructors and Contractions for Unary Connectives 



Fig. 12. Structural Rules: Right Branch 


The special property we are interested in here is the access to structural rules 
which are not globally available. This make a formula OCHA a formula which can 
permute throughout the tree and then convert to a normal A formula^ 

In the graph representation of proofs we have been working with, structural 
rules correspond to a rewrite of one tree into another. Tree rewriting allows us 
to restructure a graph in such a way that a contraction becomes possible. A graph 


3 Compare this a formula \A in linear logic, which can use the structural rules of 
contraction and weakening, then convert to a normal A formula using the dereliction 
rule \A b A. 
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Fig. 13. The minimalist move operation 


is correct if it converts to a tree using a combination of the structural rules and 
the contractions [9]. 

The set of structural rules we use for this purpose is shown in Fig. [l^l This 
figure shows the possibilities for combining a unary connective with a pair of 
binary connectives, while keeping the unary connective with its daughter z as 
one of the branches of one of the binary constructors. 

The arrows pointing inwards to the center structure, move z up towards the 
root. Depending on whether we are in a configuration as shown on the left or 
in a configuration as shown on the right only one of rules [PI] and [P2] applies. 
The arrows pointing outwards, offer a non-deterministic choice between moving 
down towards subtree x or towards subtree y. 

3.3 Move 

With the required machinery in place, it is time to return to minimalist gram¬ 
mars. Besides the merge operation, minimalist grammars define a move opera¬ 
tion. In order to model this move operation, Stabler introduced licensor features 
+X and licensee features —X. Licensor features ‘attract’ subtrees with a licensee 
feature and move them to the licensors specifier position. 

Figure I~HT1 shows a graphical representation of the minimalist move operation. 
In the figure, A is the largest tree of which —X is the head. 

We have seen in Sect. 12.31 that a minimalist grammar with only the merge 
rule naturally generates sentences in SVO order. Stabler [3] shows how the move 
operation can be used to derive SOV order instead. 

Suppose ‘speaks’, which was assigned feature list =d =d v speaks before is 
now assigned =d +k=dv speaks instead. That is to say, it still selects its object 
to the right, but then ‘attracts’ the subtree with the case feature k to its left. In 
case the determiner phrase ‘Maori’ is assigned the feature list d — k Maori , we 
derive ‘Maori speaks’ as shown in Fig. 1141 Remark that combining the structure 
on the right with a subject determiner phrase will produce the SOV order as 
required. 
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A 


Maoi 



+k =d v speaks —k Maori 


dv speaks 


e 


Fig. 14. An example of move to derive SOV word order 


3.4 Different Categorial Perspectives on Move 

There are two basic approaches to implementing the move operation into cate¬ 
gorial grammars: the approach of Vermaat [4], who analyses move using the in¬ 
troduction rules for the implications and the approach of Lecomte & Retore jS] , 
who analyse move using the elimination rule for product. Both use a form of 
commutativity to handle the actual movements. 

Vermaat’s presents two ways of implementing move: the first [4; Section 
4.2.2,4.2.3] provides a translation of minimalist feature lists into categorial for¬ 
mulas, but it has the disadvantages that it assigns formulas to the empty string 
and that it doesn’t assign the correct semantics to the assigned structures. The 
second jH Section 4.2.4] avoids these two problems but doesn’t answer the ques¬ 
tion if this method is sufficiently general to handle arbitrary minimalist gram¬ 
mars. No general translation for this second implementation is offered and no 
formal equivalence result between either of the two implementations and mini¬ 
malist grammars is proved. 

Lecomte & Retore’s proposal is close — in spirit if not in the actual details 
— to Vermaat’s first implementation. It provides a general mapping from min¬ 
imalist feature lists into categorial formulas, but requires formula assignments 
to the empty string to make the translation work. Because the translation stays 
close to the definition of minimalist grammars, equivalence is relatively easy to 
establish nsj. On the other hand, this closeness to minimalist grammars also 
means it requires some work to compute the correct semantics from a deriva¬ 
tion [HOED]. 

A slightly simplified version of the translation proposed by Amblard [121 Sec¬ 
tion 6.2.1] is shown below @ 

Figure [15] shows on the right how this translation produces formulas and 
graphs for the lexical entries =n d -wh which and =v +wh c. These two lexical 
entries allow us to treat wh extraction, for sentences such as ‘which languages 
does Tama speak?’. 

4 It is simplified in the sense that only one licensee feature — / can be present in a 
lexical entry and that a licensee feature necessarily occurs in a type with at least 
one selector feature. This suffices for the example later and is just to simplify the 
presentation. 
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11 =/ Bsr=\\Bsf /f 
11 / 11 “ = / 

||=/ Bsf = f\\\Bsf 
||+/ Bsf = f\\\Bsf 

II f-gf = g* onf 
ll/f = / 


c 



which : (wh • Odd) / n which : ((c / v) • Odd) / which : (c / (v / Odd)) / n 


Fig. 15. From products and assignments to the empty string to higher order formulas 

The analysis is divided into two parts. The first part is performed by the 
lexical entry ‘which’, shown on the bottom left: it selects an n to its right to 
become a cL which is marked for movement to a wh licensor. The second part, 
performed by the empty lexical element shown on the top left, closes of a verb 
domain to produce a clause c while adding a movement trigger to the left of this 
empty element. 

In this case, we can simplify the two lexical graphs a bit. Given that the 
positive and negative wh elements can only occur together, we can combine the 
two lexical graphs into a single graph. In addition this allows us to remove the 
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e element from the graph. The result, with the corresponding formula, is shown 
in the middle of Fig. [THl 

Using a second translation step, we can specialize the sub-formula of the form 
(.A/OdC) • B into A/(C/ODB). The resulting formula and the corresponding 
graph on the right of the figure correspond quite well to our intuitions of lexical 
formula assignments for categorial grammars and has the desired semantic form 
as well. 

These two operations: elimination of assignments to the empty string by join¬ 
ing the licensor and licensee into a single lexical entry and the formula special¬ 
ization rule seem to form the ‘missing link’ between Vermaat’s and Lecomte & 
Retore’s implementation of minimalist grammars. It can be used as a stepping 
stone both for a proof of equivalence for Vermaat’s proposal and for an easy way 
of obtaining a sort of ‘higher order’ minimalist grammars as well as the correct 
semantics for Lecomte & Retore’s proposal. 

4 Extracting Grammars from Corpora 

In the previous section, we have seen that the different ways of implementing 
minimalist grammars into categorial grammar are actually quite close. In this 
section, I will provide some surprising support for this claim in the form of large- 
coverage categorial grammars which are automatically extracted from corpora. 

4.1 The Spoken Dutch Corpus 

The Spoken Dutch Corpus (Corpus Gesproken Nederlands, or CGN) is an anno¬ 
tated corpus of around nine million words of contemporary spoken Dutch EH- 
A core corpus of slightly over one million words has been assigned syntactic 
annotations in the forms of directed acyclic graphs, where the vertices are la¬ 
beled with constituent information and the edges are labeled with dependency 
relations between these constituents. 

Figure (TB] shows an example of such an annotated graph. Vertices are drawn 
as ovals, with the syntactic category inside and edges connect vertices by lines, 
with the vertex label inside a rectangle. The arrows are not explicitly indicated 
but are assumed to move down, with the mother nodes portrayed above their 
respective daughters. 

The graph of Fig. [12] is a wh question, as indicated by the category WHQ, 
which has two daughters: a noun phrase (NP) which is the head of the wh 
question (whd) and its body (body), which is a verb-initial sentence (SV1). 

The CGN annotation has no assignments to the empty string. Instead, when 
a Chomskyan analysis would require an empty ‘trace’ element, the annotation 
gives a constituent multiple dependencies. In our example, the NP ‘welke idioot’ 
is the head of the wh question (whd) as well as the subject (su) of the verb-initial 
sentence. In the graph the NP node has therefore both the WHQ and the SV1 
vertices as its parents. 
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which idiot e g° e s now ‘something like that’ throw 

({wh j s) • OUnp) / n n (s fi inf) / np s\i s n P n p \ inf 

(s / inf) / np (s / inf) \(s/ inf) 

=n np — m n — s + m wh =i =np s =S s n P =NP i 

Fig. 16. Which idiot is going to throw something like that? 


4.2 Grammar Extraction 

Moortgat & Moot [221125] propose a parametric algorithm for extracting cate¬ 
gorial lexicons from these annotation graphs. I will only briefly sketch the way 
this algorithm transforms the annotation graphs into a categorial lexicon, using 
the preceding graph as an example. 

The SV1 node has four daughters: a subject (su), which is the extracted noun 
phrase, a head (hd), which is the main verb of the sentence, a sentence modifier 
(mod), which is the adverb ‘nou’ and finally a verbal complement (vc) which is 
an infinitival group (INF). 

The categorial formulas corresponding to the vertex label are a parameter to 
the algorithm. Here we assume SV1 is translated into s, NP is translated into 
np and INF is translated into inf. Other choices are possible: for example, we 
could use a complex np\ s category for the infinitive. 

Another parameter of the extraction algorithm is the set of dependency re¬ 
lations which correspond to modifiers. Typically, this includes (mod) but some 
syntactic categories have additional dependency roles which are, from a catego¬ 
rial point of view, best analyzed as modifiers. As usual in categorial grammars, 
modifiers are assigned formulas of the form X / X or X \ X. There is a single 
modifier daughter, which is assigned the category s \i s, indicating it modifies 
the sentence but does so using a special ‘infixation mode’ i. We can avoid using 
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different modes here by assigning the category ( s / inf) \{s / inf) to ‘nou’ instead, 
but this would result in a considerable increase in the number of lexical possibil¬ 
ities for adverbs. There is a trade-off to be made between lexical ambiguity and 
additional structural rules here which is beyond the scope of the current article. 

A final parameter identifies the functor of each constituent by its dependency 
relation. This is typically the head of the constituent, as in the current case. All 
daughters of a syntactic category which are neither functor nor modifier, the 
subject and verbal complement in this case, are arguments. A functor F selects 
its argument A and marks whether it occurs on its left or on its right by the 
direction of the implication used: F / A if the argument occurs to the right and 
A \ F if the argument occurs to the left. 

Though the position of the NP in the annotation graph is to the left of the 
verb, the subject is considered to be directly on its right instead: the canonical 
position of a subject in a verb initial phrase is directly after the verb eg. ‘gaat Piet 
nou zoiets gooien?’. This is an explicit use of linguistic information in order to 
obtain structures which are as close as possible to manually assigned categories. 

To obtain the category for the verb, we integrate the argument daughters 
from right to left. Obtaining first s / inf after the infinitive daughter and then 
(s / inf) / np as the result category for the verb. 

All of this produces a (possibly multimodal) AB grammar. However, we 
haven’t dealt with multiple dependencies yet. If a constituent play several roles 
in a phrase, we need to assign it a formula for each of these roles. The simplest 
and most general way of encoding this is to use the product of the different 
categories. However, since typically only one of these roles is a local one we must 
allow for the additional roles to be played in other constituents. This means the 
appropriate formula for a constituent which is locally of category A and which 
is of category B elsewhere would be A • ODH, where <>UB is a B formula with 
a permute modality. 

This gives the NP constituent the type (wh / s) • OO np. This is a slightly 
unusual formula for a wh element in a categorial grammar. However, it is exactly 
the type of formula we saw for the translation of the minimalist grammar in 
Sect. 13.41 and Fig. [15] and we translate it in the same way to wh / (s / OD np). 

Continuing the formula assignment to the remaining vertices, we get the lex¬ 
ical results shown in the penultimate line of Fig. [TG] The unimodal possibilities 
for ‘gaat’ and ‘nou’, the only types requiring a multimodal annotation in order 
to reduce the number of lexical entries for the adverb, are shown below the other 
types. 

The closeness of the extracted types in this section to the translated minimal¬ 
ist types of Sect. 13.41 suggests the possibility of extracting a minimalist lexicon 
using more or less the same strategy. The last line in Fig. [TG] shows a minimalist 
lexicon inspired by the categorial lexicon. In the cases where the first argument is 
selected to the left, I used a head movement with adjunction solution, indicated 
by a selector with an upper case feature. Note that — when we abstract from 
the category names — it is quite similar the minimalist grammars we have seen 
before, including the assignment of the licensor =s +m wh to the empty string. 
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All this suggests that the results from [22I2EJ can adapt to the minimalist 
case and produce a lexicon which would not be far from one assigned by hand. 
However, verifying the size and quality of a lexicon thus extracted would be an 
interesting subject requiring a considerable amount of additional research. 


5 Conclusion 

We have seen how looking at categorial grammars and minimalist grammars in 
a graph theoretic way makes the correspondence between the two systems quite 
clear. In addition, it it make the effect of different choices of the translation 
function visible in a way which has permitted us to move towards an unification 
of different translation. Finally, grammars extracted from corpora turn out to 
have a similar shape to translated minimalist grammars, which suggests the 
possibility to automatically extract minimalist grammars from treebanks. 
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Abstract. Stabler proposes an implementation of the Chomskyan Mini¬ 
malist Program [I] with Minimalist Grammars (MG) This framework 
inherits a long linguistic tradition. But the semantic calculus is more eas¬ 
ily added if one uses the Curry-Howard isomorphism. Minimalist Catego¬ 
rial Grammars (MCG), based on an extension of the Lambek calculus, 
the mixed logic, were introduced to provide a theoretically-motivated 
syntax-semantics interface [3]. In this article, we give full definitions of 
MG with algebraic tree descriptions and of MCG, and take the first steps 
towards giving a proof of inclusion of their generated languages. 


The Minimalist Program (MP), introduced by Chomsky [T], unified more than 
fifty years of linguistic research in a theoretical way. MP postulates that a logical 
form and a sound could be derived from syntactic relations. Stabler [2] , proposes 
a framework for this program in a computational perspective with Minimalist 
Grammars (MG). These grammars inherit a long tradition of generative linguis¬ 
tics. The most interesting contribution of these grammars is certainly that the 
derivation system is defined with only two rules: merge and move. The word 
Minimalist is introduced in this perspective of simplicity of the definitions of 
the framework. If the merge rule seems to be classic for this kind of treatment, 
the second rule, move , accounts for the main concepts of this theory and makes 
it possible to modify relations between elements in the derived structure. 

Even if the phonological calculus is already defined, the logical one is more 
complex to express. Recently, solutions were explored that exploited Curry’s 
distinction between tectogrammatical and phenogrammatical levels; for exam¬ 
ple, Lambda Grammars j4], Abstract Categorial Grammars j5], and Convergent 
Grammars [16]. First steps for a convergence between the Generative Theory and 
Categorial Grammars are due to S. Epstein [7]. A full volume of Language and 
Computation proposes several articles in this perspective [8], in particular [S], 
and Cornell’s works on links between Lambek calculus and Transformational 
Grammars m ■ Formulations of Minimalist Grammars in a Type-Theoretic way 
have also been proposed in mm- These frameworks were evolved in mm 
for the syntax-semantics interface. 

Defining a syntax-semantics interface is complex. In his works, Stabler pro¬ 
poses to include this treatment directly in MG. But interactions between syntax 
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and semantic properties occur at different levels of representation. One solution is 
to suppose that these two levels should be synchronized. Then, the Curry-Howard 
isomorphism could be invoked to build a logical representation of utterances. The 
Minimalist Categorial Grammars have been defined from this perspective: cap¬ 
ture the same properties as MG and propose a synchronized semantic calculus. 
We will propose definitions of these grammars in this article. But do MG and 
MCG generate the same language? In this article we take the first steps towards 
showing that they do. 

The first section proposes new definitions of Minimalist Grammars based on 
an algebraic description of trees which allows to check properties of this frame¬ 
work [3j. In the second section, we will focus on full definitions of Minimalist 
Categorial Grammars (especially the phonological calculus). We will give a short 
motivation for the syntax-semantics interface, but the complete presentation is 
delayed to a specific article with a complete example. These two parts should be 
viewed as a first step of the proof of mutual inclusion of languages between MG 
and MCG. This property is important because it enables us to reduce MG’s to 
MCG, and we have a well-defined syntax-semantics interface for MCG. 

1 Minimalist Grammars 

Minimalist Grammars were introduced by Stabler [2] to encode the Minimalist 
Program of Chomsky |T]. They capture linguistic relations between constituents 
and build trees close to classical Generative Analyses. 

These grammars are fully lexicalized, that is to say they are specified by their 
lexicon. They are quite different from the traditional definition of lexicalized 
because they allow the use of specific items which do not carry any phonological 
form. The use of theses items implies that MG represent more than syntactic 
relations and must be seen as a meta-calculus lead by the syntax. 

These grammars build trees with two rules: merge and move which are trigged 
by features. This section presents all the definitions of MG in a formal way, using 
algebraic descriptions of trees. 

1.1 Minimalist Tree Structures 

To provide formal descriptions of Minimalist Grammars, we differ from tradi¬ 
tional definitions by using an algebraic description of trees: a sub-tree is defined 
by its context, as in m and m For example, the figure on the left of Fig. |T] 
shows two subtrees in a tree ( t\ and £ 2 ) and their context ( C\ and C 2 ). Before 
we explain the relations in minimalist trees, we give the formal material used to 
define a tree by its context. 


Graded Alphabets and Trees. Trees are defined from a graded set. A graded 
set is made up of a support set, noted £, the alphabet of the tree, and a rank 
function, noted a , which defines node arity (the graded terminology results from 
the rank function). In the following, we will use £ to denote a graded (A, cr). 
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The set of trees built on E, written Ts, is the smallest set of strings (E U 
{(;);, })*. A leaf of a tree is a node of arity 0, denoted by a instead of a(). For 
a tree t, if t = cr(ti, ■ ■ ■ ,tk), the root node of t is written a . 

Moreover, a set of variables X = {xi,X 2 , ■ • ■} is added for these trees. Xk is 
the set of k variables. These variables mark positions in trees. By using variables, 
we define a substitution rule: given a tree t £ T^(x k ) (i.e. a tree which contains 
instances of k variables x\,■■■ ,Xk) and fi, • ■ ■ ,tk, k trees in T'x, the tree obtained 
by simultaneous substitution of each instance of x\ by f i, ..., Xk by tk is denoted 
by t[ti, • • •, ifc]. The set of all subtrees of t is noted S t . 

Thus, for a given tree t and a given node n of t, the subtree for which n is the 
root is denoted by t with this subtree replaced by a variable. 

Minimalist trees are produced by Minimalist Grammars and they are built on 
the graded alphabet {<, >, E}, whose ranks of < and > are 2 and 0 for strings 
of E. Minimalist Trees are binary ones whose nodes are labelled with < or >, 
and whose leaves contain strings of E. 


Relations between Sub-Trees. We formalise relations for different positions 
of elements in St ■ Intuitively, these define the concept of be above , be on the right 
or on the left. A specific relation on minimalist trees is also defined: projection 
that introduces the concept of be the main element in a tree. 

In the following, we assume a given graded alphabet E. Proofs of principal 
properties and closure properties are all detailed in [3]. The first relation is the 
dominance which informally is the concept of be above. 

Definition 1 . Let t £ T-%, and C\,C'2 £ S t , C\ dominates C2 (written C\<\* 
C 2 ) if there exists C' £ St such that C\[C') = C2. 

Figure |T] shows an example of dominance in a tree. One interesting property 
of this algebraic description of trees is that properties in sub-trees pass to tree. 
For example, in a given tree t, if there exists Ci and C 2 such that Ci <* C 2 , 
using a 1-context C, we could build a new tree t' = C[t] (substitution in the 
position marked by a variable). Then, C\C\] and ClCy exist (they are part of 
t') such that C\Ci] <C\C 2 ]. 

Definition 2 . Let t £ Ts, C\, C2 £ St, C\ immediately precedes C2 (written 
Ci -<C2) if there exists C £ St such that: 

1. Ci = C[a(ti,...,tj,Xi,tj + 2 ,...,t k )\ and 

2. C 2 — C (1 1 , . • • , tj , tj'-f-l, Xi , . . . , tk)\ • 

Precedence, written , is the smallest relation defined by the following rules 
(transitivity rule, closure rule and relation between dominance and precedence 
relation): 

Cl C ’2 C *2 C *3 Cl -< C2 Cl <\* C2 

- irons - * - \dom 

Cl ^ C 3 Cl <~C 2 C 2 Cl 
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— Ci is the context of the sub-tree 
ti 

— C 2 is the context of the sub-tree 
t2 

— Ci <* C2 means that the root 
node of ti is higher than the 
root node of t 2 in the full tree 



— Ci is the context of the sub-tree 
ti 

— C2 is the context of the sub-tree 
t2 

— Ci <* C2 means that the root 
node of ti is on the left side of 
the root node of t 2 in the tree 


Fig. 1. Dominance and precedence relations in trees 


Precedence encodes the relation be on the left, (and then be on the right) or 
be above another element (using the dominance). These two relations stay true 
for substitution (as mentioned above). 

The next relation does not define a tree relation. It realises a linguistic prop¬ 
erty by leading the concept of be the main element in a structure (or a substruc¬ 
ture) . 


Definition 3 . Let t £ T^ MG (A), and Ci,C 2 G St, Ci immediately projects 
on C2 (written C\ < C2) if there exists C £ St such that one of the two following 
properties holds: 

1 . Ci = C[<(xi,t2)\ and C 2 = C[<(ti,xi)], 

2 . C\ = C'[>(t 2 ,a:i)] and C 2 = C'[>(xi,ti)], 


in this case C <iC 1 and C< C 2 - If Ci<C 2 or C2<Ci, then there exists C such 
that C <\ Ci and C <\ C2 ■ 

is the smallest relation defined by the following system of rules: 


C e S t 

c<~c 


[ 0 ] 


Ci<~C 2 c 2 <~c 3 

Ci<~C 3 


[trans] 


Ci<C 2 

Ci<~C 2 


Ci<*C 2 C 3 <\*C 4 c 2 <c 3 

[A] 


Ci<C 2 


c 2 <c 3 


[B\ 


c\<~c 4 


C 2 <~C 1 
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Note that the projection relation is transitive. All the properties of these three 
relations are proven in [3]. Figure [5] presents three minimalist trees where in t the 
main element is the verb walks (which is accessible by following the projection 
relation). 

These three relations could seem quite complicated for a reader who is not 
familiar with these notations or the zipper theory. But their expressiveness allows 
to prove the structural properties assumed for MG and moreover to give the proof 
of languages inclusion with MCG. Finally, in this section, we have defined the 
concept of parent and child relations in trees plus the projection relation which 
defines constituents in linguistic descriptions. 

1.2 Linguistic Structures in Trees 

From the linguistic perspective, trees represent relationships between the gram¬ 
matical elements of an utterance. Linguistic concepts are associated with mini¬ 
malist tree structures. These relationships have been proposed for the analysis of 
structural analogies between verbal and nominal groups. Thus, groups of words 
in a coherent statement (phrases), whatever their nature, have a similar struc¬ 
ture. This is supposed to be the same for all languages, regardless of the order 
of sub-terms. This assumption is one of the basic ideas of the X-bar theory 
introduced in the seventies [18] and in the MP [T]. 

The Head. is the element around which a group is composed. An easy way 
to find the head of a minimalist tree is to follow the projection relation of the 
nodes. 

Definition 4. Let t £ Tmg, if for all C' £ S t . C<~C' then C is called the 
head oft. For a given tree t £ Tmg, we write H t [x\ £ St a sub-tree oft of which 
x is the head, and head{t) is a leaf which is the head oft. Then t = H t [head(t)\. 

For a minimalist tree, there always exists a unique minimal element for the 
projection relation and it is a leaf (which is the head of the tree) [3] . 

For example, the head of the minimalist tree in Fig. [2]is the leaf walks (follow 
the direction of the projection relation in nodes and stop in a leaf). Subtrees 
have their own head, for example the leaf a is the head of the subtree t\ (in 
Fig. E] and the preposition in is the head ot t%. 

Maximal Projection, is, for a leaf /, the largest subtree for which l is the 
head. This is the inverse notion of head. In the minimalist tree of Fig. [2] the 
maximal projection of the leaf walks is the full tree t. To describe other maximal 
projections in this example, the maximal projection of a is the subtree which 
contains a man and the maximal projection of the man is the leaf man. In a 
more formal way, the maximal projection is defined as follows: 

Definition 5. Let t £ Tmg > C £ St- The maximal projection of C (denoted 
by proj max (C)) is the subtree defined by: 
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> 



the street 

t tl t2 


Fig. 2. A minimalist tree t and two of its sub-tree 


— if C = x i, proj max (C) = X! 

— if C= C'[< (xi,£)] orC = C'[> {t,x i)], proj max (C) = proj max (C') 

— if C=C'[< (t,x i)] or C = C'[> (xi ,t)\, proj max (C) = C 

Then proj max (walks) = t. This logical characterization of minimalist trees 
and structural relations allows to prove different properties of MG (for example 
that the projection is anti-symmetric) [3]. 

Complement and Specifier, are relations on subtrees with respect to the 
head. 

Elements coming after the head provide information and they are in the 
complement relation. Let t £ Smg, Ci is a complement of head(t) = C, if 
proj rn a X (C) <1* Ci and C -<; + Ci, denoted by Cl comp C. 

In the tree t of Fig. [2] the subtree is in a complement relation with the 
head walks. It adds information to the verb. 

By contrast, elements placed before the head determine who (or what) is in the 
relationship. Let t £ Smg > Ci is a specifier of head(t) = C, if proj max {C) <* Ci 
and Ci C, denoted by Ci spec C. 

In the tree t of Figj2j the subtree t\ is in a specifier relation with the head 
walks. It specifies interpretation of the verb. 

1.3 Minimalist Grammars 

The computational system of MG is entirely based on features which represent 
linguistic properties of constituents. Rules are trigged by these features and 
they build minimalist trees. A Minimalist Grammar is defined by a quintuplet 
(V, Features, Lex, F, c) where: 

— V is a finite set of non-syntactic features, which contains two sets: P (phono¬ 
logical forms, marked with / /), and I (logical forms, marked with ()). 

— Features = {B U S U L a U L e } is a finite set of syntactic features, 

— Lex is a set of complex expressions from P and Features (lexical items), 

— <F = {merge, move} is the set of generative rules, 

— c £ Features is the feature which allows to accept derivations. 
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-d 


Fig. 3. Automata of acceptable sequences of features where b £ B and d £ D 


The final tree of a derivation which ends with acceptance is called a deriva¬ 
tional tree , which corresponds to a classical generative analysis. Phonological 
forms are used as lexical items (and they could be seen as the grammar’s ter¬ 
minal symbols). A left-to-right reading of phonological forms in derived and 
accepted structures provides the recognized string. But intermediate trees in a 
derivation do not stand for this. Only the derivational tree allows to recognize 
a string. This results from the move rule which modifies the tree structure. For 
a MG G , the language Lq recognized by G is the closure of the lexicon by the 
generation rules. 

1.4 Features 

A MG is defined by its lexicon which stores its resources. Lexical items consist of 
a phonological form and a list of syntactic features. The syntactic set of features 
is divided in two subsets: one for basic categories, denoted B , and one for move 
features, denoted D. Different types of features are: 

— B = {v, dp, c, • • ■} the set of basic features. Elements of B denote standard 
linguistic categories. Note that this set contains c, the accepting feature (I 
assume it is unique at least). 

— S = {=d | d £ B} the set of selectors which expresses the necessity of 
another feature of B of the same type (for d £ B, —d is the dual selector). 

— L a = {+k | k £ D} the set of licensors. These features assign an expres¬ 
sion’s property to complement another in a specifier-head relation. 

— L e = {— k | k £ D} the set of licensees. These features state that the 
expression needs to be complemented by a similar licensor. 

Lexical sequences of features follow the syntax: /FP/ : (5(5 U L a )*)*B(L e )* 

Vermaat mi, proposes an automata which recognises the acceptable sequences, 
proposed in Fig. [3l This structure could be divided in two parts: the first con¬ 
taining a sequence of selectors and licensors (features which trigger rules, as we 
shall see), and the second which contains only one basic feature (the grammatical 
category associated to the expression) and a sequence of licensees. The first part 
corresponds to stat I and II and the second to stat III and transitions to this 
state. In the following, e will denote any feature and E a sequence of features 
(possibly empty). 

For example, the sequence associated with an intransitive verb will be: 


=d +case v 
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which means that this verb must be jointed with a determinal phrase (determinal 
comes from the Generative Theory), a complex expression with feature d. Then 
it must be combined with a —case , we will see how in the next section, an then 
there is a structure associated with verb (feature v). 

Transitive verbs will extend the intransitive ones wth the list: 

—d +case =d +case v 

The two =d correspond to the subject and the object of the verb. The first case 
will be accusative and the second nominative. 

Another example is determiners: they are combined with a noun to build a 
determiner phrase and need to be unified in the structure (see the next section). 
Here is an example of lexicon which contains a verb, a noun and a determiner: 

walks : =d +case v 
a : =n d —case 
man : n 


1.5 MG Rules 

<P, the set of generating rules, contains only: merge and move. A derivation is a 
succession of rule applications which build trees. These trees are partial results: 
the structural order of phonological forms does not need to correspond to the 
final one. In the MP, a specific point, called Spell-Out is the border between 
the calculus of derivations and the final result. Rules are trigged by the feature 
occurring as the first element of list of features of the head. 


Merge, is the process which connects different parts. It is an operation which 
joins two trees to build a new one: 

merge : T M g x T M g —> Tmg 

It is triggered by a selector (=x) at the top of the list of features of the head 
and it is realised with a corresponding basic feature (x) at the top of the list of 
features of the head of a second tree. Merge adds a new root which dominates 
both trees and cancels the two features. The specifier/complement relation is 
implied by the lexical status of the tree which carried the selector. The new root 
node points to this tree. 

Let t.t! G Tmg be such that t = H t [l : =h E] and t' = H t '[l' : h E '] with 
h <E B : 


merge{t , t') 


< (l : E, H t '[V : E']) if t £ Lex, 
> (H t ’[l' : E'],H t [l: E]) otherwise. 


Figure U presents the graphical representation of merge. 

For example, to derive a man walks , we first need to combine a with man, 
and then to combine the result with the verb: 


MG and MCG, Definitions 


69 



Fig. 4. Tree representation of merge and move. 


< 


a man 
=?rCd—case X 


and 


walks 

pXl + case v 



a man 
4 — case X 


Obtained trees do not verify the word order (only the final tree will check 
the right word order). In this example, the selectors are carried by lexical items, 
then projection relations point to the left in both cases. 


Move, encodes the main idea of the Minimalist Program. It corresponds to the 
movement of a constituent to the top position of the derivation. Move is trigged 
by a licensor (+*) at the top of the list of features of the head of a tree. Then, 
it looks for a corresponding licensee (—x) at the top of the list of features of 
the head inside the tree. If these conditions are met, the maximal projection of 
the node which carries the licensee is moved to the left of a new root. This node 
points to the right (the subtree which carries the former head). Both licensor and 
licensee are cancelled. The root of the moved maximal projection is substituted 
by an empty leaf (e). This new leaf is called the trace of the move. 

Figure ^ shows a graphical representation of the move rule where the head 
of C carries a +g in its top features list. Then we look for a leaf with —g in 
its top features list and then find its maximal projection (C 2 ) which contains 
all the elements which depend on it. Finally this sub-tree is moved to the left 
position of a new root node. Intuitively, a linguistic property is checked and the 
consequence is a move in first position in the tree. And strictly: 

move : T M g -> T M g 

For all tree t = C[l : +g E, l' : —g E 1 ], such that t = H t [l : +g E], there exists 
C\,C 2 £ St such that: C 2 is the maximal projection of the leaf V and C\ is t 
deprived of C 2 . Then, t = C\[l : +g E, C '2 [V : — g E']\ where: 
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- C 2 [l':-gE']=proj max (C[l':-gE}) 

- Ci[l : +g E,x i] = proj max (C[l : +g E,x i]) 


move{t) = >(C 2 [l' ■ E'],C 1 [l :E,e}) 

Figure 0] presents the graphical representation of move. 

Stabler introduces some refinements to these grammars. Let us mention them. 
He introduces a second move: weak move , which does not move the phonological 
forms. The precedent move is then called strong move , which is trigged with 
capital features. The weak move is, like strong move: 

move(t) = >{C 2 [e:E'],C 1 [l : E,l'}) 

Variations on strong/weak values achieve variations on phonological order. 
This is an instance of the use of parameters of the Minimalist Program. 

Moreover, restrictions can be introduced on MG derivations. An important 
one is the Shortest Move Condition (SMC) which blocks move in case of ambi¬ 
guity on licensees. Then, the move operation of MG with SMC is deterministic. 

A locality condition could also be introduced: Specifier Island Condition (SPIC). 
“Islands” define areas which prohibit extractions. With SPIC, a subtree cannot 
be moved if it is in a specifier relation within a subtree. This condition was 
introduced by Stabler [20] drawing on [2T| and 22]. who proposes that moved 
elements had to be in a complement relation. 

In the previous example, the head of the last tree is the leaf walks which 
contains a +case feature as first element of its list. Then, a move is trigged 
in the tree with the leaf a which carries a (—case). The resulting tree is the 
following: 



a man walks e 
=?rC fl ^exrse X ^S.^=-exrSe v 


The move operation modifies the position of the maximal projection of the 
leaf which carries the —case. The old position is substituted by an empty leaf 
(e). Finally, the tree contains only one feature which is v. In this small example, 
I did not discuss the validity of the final feature, but in a real derivation, we 
assume that it is not the verb which carries the +case licensor which corresponds 
to the nominal case, but it is a specific item. This item corresponds to the 
morphological mark of the verb. Then each acceptable derivation assumes that 
a verb has received its time (and other properties). But exhibiting the use of 
this item needs other refinements of the two rules (Head-movement and Affix- 
Hopping) . 
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This section did not propose a new framework for computational linguistics. 
This is a new definition of Stabler proposal. This way, assumed properties of min¬ 
imalist trees have been fully proved [3] . Moreover this algebraic definition of MG 
is a perfect description to compare generated languages with other frameworks. 
Finally, this modifies the point of view on derivations and shows all steps of the 
calculus as substitution. One missing point is still the introduction of a semantic 
calculus. Let us now develop MCG which are defined with a syntax-semantics 
interface. 


2 Minimalist Categorial Grammars (MCG) 

In this section, we define a new Type-Theoretic Framework which is provided 
by the mixed calculus, a formulation of Partially Commutative Linear Logic. It 
proposes to simulate MG and then keep linguistic properties of the Minimalist 
Program. MCG are motivated by the syntax-semantics interface [3]. This in¬ 
terface, as for Lambek calculus, is based on an extension of the Curry-Howard 
isomorphism [[23] . Even though this interface is not the aim of this paper, let us 
discuss some important points. 

The idea of encoding MP with Lambek calculus arises from El and ex¬ 
tended versions of this work. In these propositions, the calculus is always non- 
commutative, a property needed to model the left-right relation in sentences. But 
the move operation could not be defined in a proper way with non-commutative 
relation. In particular, in complex utterances, the non-commutativity implies 
that a constituent (for example the object DP) must be fully treated before 
another one is introduced (for example the subject DP). Otherwise, features 
are mixed and non-commutativity blocks resolutions. It is not acceptable to 
normalize the framework with such a strong property and it makes the system 
inconsistent in regard to linguistics. 

The solution we propose is to define a new framework which allows to deal 
with commutative and non-commutative connectors: the mixed calculus. The 
main consequence on the model of this calculus is that variables in logical for¬ 
mulae are introduced at different places and must be unified later. In [3] we 
show how the unification is used to capture semantic phenomena which are not 
easily included. In few words, the idea is to consider proofs of mixed calculus as 
phases of a verb. Phases have been introduced by Chomsky to detail different 
modifications which occur on a verb. Several linguists have showed that phases 
have implications on semantics, for example the theta-roles must be allocated 
after a specific phase. This is exactly the result of the syntax-semantics inter¬ 
face of MCG. Full explanations need more space to be presented, but the main 
contribution of MCG is to propose an efficient syntax-semantics interface in the 
same perspective as MG. 

In this section, we will detail MCG and expose their structural link with MG. 
First we present the mixed calculus, then we give definitions of MCG and show 
proofs of the mixed calculus produced by MCG (together with their linguistic 
properties). 
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2.1 Mixed Calculus 

MCG are provided with mixed calculus [24], a formulation of Partially Commu¬ 
tative Linear Logic. Hypotheses are either in a non-commutative order (<; >) or 
in a commutative one ((,)) The plain calculus contains introduction and elimi¬ 
nation rules for: 


— the non-commutative product 0: 

AhAQB r,< A; B >,T' \- C 

r, A,r'\-c 

— its residuals (/ and \): 


[0e] 


rh A Ah A\C 
<r-A>hc 


M 


Ah A r h B 

-1 ©i 

< A-r>h A&B 


A h A/C rh A 
< A-r >h c 


[/e] 


< A-,r >h c 

the commutative product 0: 

AhA®B r,(d,B),r'hc 


— its residual 


r, A,r'hc 
rh A AhA- 


c 


(r,A)hc 


<r-A>hc 

rhc/A 


[/<] 


Ah A rh B 
(A,r) h A®B ^ 

(A,r)hC r . 


The product connectors of the mixed calculus use in a first step hypotheses to 
mark positions in the proof and in a second one substitute the result of an an¬ 
other proof in these positions using a product elimination (the commutative/non- 
commutative status depends on relations between hypotheses). This is exactly 
the process we will use to define the move rule of MCGs. 

Moreover, the calculus contains an axiom rule and an entropy rule. This last 
one allows to relax the order between hypotheses. We will use this rule to define 
merge in MCG as we will see in the following section. 


- \ axiom] 

Ah A 1 


rhc 

-[entropy—whenever r' a T] 

r’hc 


This calculus has been shown to be normalizable [23] and derivations of MCG 
will be proofs of the mixed calculus in normal form. 


2.2 Minimalist Categorial Grammars 

As MG, MCG are lexicalized grammars. Derivations are led by formulae associ¬ 
ated with lexical items built with connectors of the mixed logic. They are specific 
proofs of the mixed logic, labelled to realise the phonological and semantic tiers. 
Phonological labels on proofs will be presented with definitions of MCG rules. 
A MCG is defined by a quintuplet (N, P, Lex,#, C) where : 

— N is the union of two finite disjoint sets Ph. and I which are respectively the 
set of phonological forms and the one of logical forms. 
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— P is the union of two finite disjoint sets Pi and P2 which are respectively the 
set of constituent features (the set B of MG) and the one of move features 
(the set D of MG). 

— Lex is a finite subset of E x F x I, the set of lexical items Q- 

— <P = {merge, move} is the set of generative rules, 

— C £ P is the accepting formulae. 

As mentioned in the previous section, move is defined using a product elimi¬ 
nation. In MG, a constituent is first introduced in a tree using its basic feature 
and then can be moved using its licensees. In MCG, a constituent will be intro¬ 
duced only when all its positions (which correspond to the basic feature and its 
licensees) have been marked in the proof by specific hypotheses. But we need 
to distinguish the type of the basic feature from the licensees features. That is 
why P is divided in two subsets Pi and P2. This sub-typing of formulae is used 
to well define lexicons of MCG. 

The set E is Ph*, and the set F, the set of formulae used to build Lex, is 
defined with the set p, the commutative product <g> and the two non-commutative 
implications / and \. Formulae of F are recognized by the non-terminal L of the 
following grammar: 


L::=(B)/Pi I C 
B::=Pi\(B) |p 2 \(b) I C 
C ::= P 2 <g> (c) I Cl 
Ci ::= Pi 

In more details, MCG formulae start with a / which is followed by a se¬ 
quence of \. This sequence contains operators allowing to compose the proof 
with another one (operators are the translation of selectors and licensors). Lex¬ 
ical formulae are ended by a sequence of (g). To sum up, these formulae have the 
structure (c m \ ... \ci\(&i ® ® b n ® a))/d, with a £ Pi, bi £ P 2 , Cj £ P and 

d £ Pi. This structure corresponds to the two parts of the list of features we 
have mentioned in the previous section. 

For the example a man walks, the MCG’s lexicon is the following: 

walks : case\v/d 
a : (case ® d)/n 
man : n 

Licensees, which express the need for an information, are there seen as a 
specific part of the basic feature (a part of the main sub-type). Licensors will 
be cancelled with an hypothesis to mark a position in the proof. Distinction 
between them is not written by an ad hoc marker but by structural relations 
inside the formula. Before we explain the move and merge rules, let us present 
the phonological tiers. 

1 In the following, Lex is a subset of E x F. The semantic part is used for the syntax- 

semantics interface which is not detailed here. 
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2.3 Derivations 

Labels. Derivations of MCG are labelled proofs of the mixed calculus. Before 
defining labelling, we define labels and operations on them. 

Let V be an uncountable and finite set of variables such that: PhC\V = 0. T is 
the union of Ph and V. We define the set E, called labels set as the set of triplets 
of elements of T*. Every position in a triplet has a linguistic interpretation: they 
correspond to specifier/head/complement relations of minimalist trees. A label 
r will be considered as r = [r spec ,r head ,r comp ). 

For a label in which there is an empty position, we adopt the following 
notation: r_h e ad (Gpec? G Gomp)> T—spec ? Thead, comp) 5 Und T— cornp 

(i"spechead, e). We introduce variables in the string triplets and a substitution 
operation. They are used to modify a position inside a triplet by a specific 
material. Intuitively, this is the counterpart in the phonological calculus of the 
product elimination. The set of variables with at least one in r is denoted by 
Var{r). The number of occurrences of a variable x in a string s £ T* is denoted 
by |s| x , and the number of occurrences of x in r by <p x (r). A label is linear if for 
all x in V, (p x ( r ) ^ L 

A substitution is a partial function from V to T*. For a a substitution, s a 
string of T* and r a label, we note s.cr and r.a the string and the label obtained by 
the simultaneous substitution in s and r of the variables by the values associated 
by a (variables for which a is not defined remain the same). 

If the domain of definition of a substitution cr is finite and equal to Xi,... ,x n 
and cr(xi) = U, then a is denoted by [t i/x±,..., t n /x n }. Moreover, for a sequence 
s and a label r, s.cr and r.a are respectively denoted s\t\/x\,... ,t n /x n \ and 
r[t\/x\,... ,t n /x n }. Every injective substitution which takes values in V is called 
renaming. Two labels rq and r 2 (respectively two strings Si and s 2 ) are equal 
modulo a renaming of variables if there exists a renaming a such that 7 * 1 .cr = r 2 
( resp. s\.a = S 2 ). 

Finally, we need another operation on string triplets which allows to combine 
them together: the string concatenation of T* is noted •. Let Concat be the 
operation of concatenation on labels which concatenates the three components 
in the linear order: for r £ E, Concatir) = r spec • rh ea d • r comp . 

We then have defined a phonological structure which encodes specifier/comp¬ 
lement/head relations and two operations: substitution and concatenation. These 
two operations will be counterparts in the phonological calculus of merge and 
move. 

Labelled Proofs. Before exhibiting the rules of MCG, the concept of labelling 
on a subset of rules of the mixed logic is introduced. Minimalist logic is the 
fragment of mixed logic composed by the axiom rule, \ e , / e , <8> e and IZ. 

For a given MCG G = (TV, p, Lex, C), let a G-background be x : A with 
x £ V and A £ F, or (Gi;Gz) or else (GpGy with G\ and G 2 some G- 
backgrounds which are defined on two disjoint sets of variables. G-backgrounds 
are series-parallel orders on subsets of V x F. They are naturally extended to the 
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entropy rule, noted C. A G-sequent is a sequent of the form: r b G ( r s ,rt,r c ) : B 
where A is a G-background, B £ F and (r s ,rt,r c ) £ S. 

A G-labelling is a derivation of a G-sequent obtained with the following rules: 


(s, A) £ Lex 
!~G (e, s, e) : A 


[Lex] 


X £ V 

- [axiom] 

x : A b g (e, x,e) : A 

r b g r\ : A / B A b G V2 : B Var{r\) D Var(r 2 ) = 0 
(r; A) b a {ris, ru, r lc • Concat(r 2 )) : A 

A b G r 2 : B r b g T\ : B\A Var{r\) fl Var(r 2 ) = 0 
(-T; A) b G ( Concat(r 2 ) • ri s ,ri t ,ri c ) : A 

-T b G ri : A (g) B A[x : A,y : B] b G r 2 : G Vhr(ri) fl Var(?’ 2 ) = 0 A € P 2 
A[r] b G r 2 [Concat(ri)/x, e/y] : C 

r b G r : A r'tf 

-[ 1 =] 

rb G r : A 


Note that a G-labelling is a proof tree of the minimalist logic on which sequent 
hypotheses are decorated with variables and sequent conclusions are decorated 
with labels. Product elimination is used with a substitution on labels and impli¬ 
cation connectors with concatenation (a triplet is introduced in another one by 
concatenating its three components). 

If r b G r : B is a derivable G-sequent, then r is linear, and Var(r) is exactly 
the set of variables in r. Finally, for all renamings a, F.c r b G r.a : B is a 
G-sequent differentiable. 


Merge and Move Rules, are simulated by combinations of rules of the min¬ 
imalist logic producing G-labeling. 

Merge is the elimination of / ( resp. \) immediately followed by an entropy 
rule. The meaning of this rule is joining two elements in regard to the left-right 
order (then non-commutative connectors are used) and, as mentioned earlier, 
all hypotheses must be accessible. To respect this, a commutative order between 
hypotheses is needed. Then an entropy rule immediately follows each implication 
elimination. 

For the phonological tier, a label is concatenated in the complement (respec¬ 
tively specifier) position in another one. Note that a merge which uses / must 
be realized with a lexical item, so the context is always empty. 
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(Tspec* Theadi Tcomp ) '• A J B A \ ~ S B 
A b {r s Vec , r head, r comp • Concat(s)) : A 


l/e 


A I- (r spec, r header comp • Concat(s)) : A 

A\~ S : B r b (t speci ‘Theadi T comp ) • B \ A 
(A- r) b (Concat(s) • r S pec,rhead,r C omp) ■ A 
A, r b (Concat(s) • r S pec, r h ead,r C omp) ■ A 




[\e] 

[ 1 =] 


These combinations of rules are noted [mg]. 

For example, the proof of the utterance a man walks begins with the formulae 
of walks: case\v/d. The first step of the calculus is to introduce two hypotheses, 
one for d and the other for case. The result is the following proof: 


b (e, walks, e) : case\v/d u: d b (e, u,e) : d 


v : case b (e, v, e) : case 


u : d b (e, walks, u ) : case\v 


9 ] 


mg\ 

(v : case, u : d) b (e, walks, u) : v 

In parallel, the derivation joins the determiner a and the noun man: 


b (e, a, e) : (case ® d)/n b (e, man, e) : n 

--—- n -:- l m n\ 

b (e, a, man) : case ®> d 

Note that the first proof contains two hypotheses which correspond to the 
type of the main formula in the second proof. The link between these two proofs 
will be made by a move, as we will show later. 

Move is simulated by an elimination of a commutative product in a proof 
and, for the phonological calculus, is a substitution. We have structured the 
lexicons and the merge rule to delay to the move rule only the substitution part 
of the calculus. 


r b ri : A ® B A[u : A, v : B] b r 2 : C 
Z\[T] b r 2 [Concat{r\)/u, e/v] : C 


This rule is applied only if A £ P 2 and B is of the form B 1 x ... B n x D where 
Bi € P 2 and D £ Pi. 

This rule is noted [mv]. Move uses hypotheses as resources. The calculus places 
hypotheses in the proof, and when all hypotheses corresponding to a constituent 
are introduced, this constituent is substituted. The hypothesis Pi is the first 
place of a moved constituent and hypotheses of P 2 mark the different places 
where the constituent is moved or have a trace. 

In recent propositions, Chomsky proposes to delay all moves after the reali¬ 
sation of all merges. MCG could not encode this but contrary to MG where a 
move blocks all the process, in MCG merge could happen, except in the case of 
hypotheses of a given constituent shared by two proofs which must be linked by 


a move. 
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In our example, we have two proofs: 

— one for the verb: (v : case, u : d) b (e, walks, u) : v 

— one for the DP: b (e, a, man) : case ® d 

The first hypothesis corresponds to the entry position of the DP in MG and the 
second to the moved position. Here, we directly introduce the DP by eliminating 
the two hypotheses in the same step: 


b (e, a, man ) : case ® d {v : case,u : d) b (e, walks, u) : v 
b (o man, walks, e) : v 


[mv] 


The phonological result is a man walks. The proof encodes the same structure 
as the derivational tree of MG (modulo a small transduction on the proof). 

For cyclic move (where a constituent is moved several times) all hypotheses 
inside this move must be linked together upon their introduction in the proof. For 
this, when a new hypothesis A is introduced, a [mv\ is applied with a sequent with 
hypothesis A®B b A®B where A is in P 2 and B is of the form B\(&.. .®B n ®D 
where Bi £ P2 and D £ Pi. 


x : A <g> B h (e, x, e) : A ® B A[u : A, v : B\ b r : C 
A[A ® B] b r[x/u, e/v] : C 


[®e] 


In the definition of merge, the systematic use of entropy comes from the def¬ 
inition of move. As it was presented, move consumes hypotheses of the proof. 
But, from a linguistic perspective, these hypotheses could not be supposed in¬ 
troduced next to each other. The non-commutative order inferred from \ e and 
/ e blocks the move application. To avoid this, the entropy rule places them in 
commutative order. In MCG, all hypotheses are in the same relation, then to 
simplify the reading of proofs, the order is denoted only with 

The strong/weak move could be simulated with the localization of the substi¬ 
tution (if hypotheses are in Pi or P2). 


s : r b A ® B r[u, v] : A[u : A,v : B] C 
r[Concat{s)/u,e/v] : A[r ] b C 


\mOVe strong) 


s : r b A (gi B r[u, v] : A[u : A, v : B] b C 
r[e/u, Concat{s)/v] : A[r] b C 


\mOVe weak] 


This version of move is quite different from the one presented for MG, but is 
close to one developed for later MG such as I2S1- 

The main difference between MG and MCG comes from move: in MCG, 
constituents do not move but use hypotheses marking their places. MCG uses 
commutativity properties of mixed logic and see hypotheses as resources. To sum 
up, the derivation rules of MCG is the following set of rules: 
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(s, A) £ Lex 


[Le 


(x speci head ? T comp) ' A j B A b S B 


b G (e, s,e) : A A b ( r spec , r head , r comp • Concat(s)) : A 

A b S : B r b (v S p ec , Vfieadj ^ comp ) • B \ A 


mg] 


A,r b (Concat(s) • r spe c, r/^ad, r comp ) : A 

rbi'i \ A(& B A\u : A, v : B\ \- r2 ■ C 
A[r] \- r2[Concat{ri) /u,e/v\ : C 


[mg\ 


The set B G of recognized derivations by a MCG G is the set of proofs obtained 
with this set of rules and for which the concluding sequent is b r : C. The 
language generated by G is L(G) = {Concat{r)\ b r : C £ O g }. 

These derivations do not formally conserve the projection relation (nor the 
specifier, head and complement relations). These principles are reintroduced with 
strings. However, the head of a proof could be seen as the principal formula of 
mixed logic, and then by extension, the maximal projection is the proof for which 
a formula is the principal one. Specifier and complement are only elements on 
the right or left of this formula. 

An interesting remark is that rules of MCG do not use the introduction rule 
of the mixed calculus. This way, they only try to combine together formulae 
extracted from a lexicon and hypotheses. As in MG where a derivation cancels 
features, the MCG system only consumes hypotheses and always reduces the 
size of the main formula (only the size of the context could increase). This corre¬ 
sponds to the cognitive fact that we stress the system in the analysis perspective. 
Introduction rules could be seen as captured by the given lexicon. But, because 
of the strong structure of the items, we directly associate formulae and strings. 

We have presented all the MCG rules and lexicon, and illustrated them with 
a tiny example which encodes the main properties of this framework. 

3 Conclusion 

In this article, we propose new definitions of MG based on an algebraic descrip¬ 
tion of trees. These definitions allow to check properties of this framework and 
moreover give a formal account to analyse links with other frameworks. Then, 
we give the definitions of MCG, a Type-Theoretic framework for MG. In this 
framework, merge and move are simulated by rules of the mixed logic (an ex¬ 
tension of Lambek calculus to product and non-commutative connectors). The 
phonological calculus is added by labelling proofs of this logic. 

The main contribution of MCG is certainly its syntax-semantics interface. 
This calculus is synchronized on proofs of MCG. But more technical details are 
needed to present this interface and the linguistic properties which it encodes. 
We delay the presentation of this interface to a future presentation. 

Finally, the syntax-semantics interface of MCG should be used under the 
condition they keep properties of MG. This is the aim of another future article 
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which will present the proof of inclusion of MG generated languages in MCG 
generated languages. To prove this property, two alternative representations of 
MG and MCG derivations are introduced: alternative derived structures and 
split proofs and the corresponding merge and move. These structures and rules 
make the gap between the two kinds of derivations. They need technical details 
and more space to be presented. 

Definitions and proofs could be easily extended to refinements of merge: Affix- 
Hopping and Head-Movement because these operations derived the same strings 
in both structures. But we have not included these rules in this presentation. On 
another hand, the proof of inclusion presented here does not include the SMC. 
The interpretation of SMC in MCG must be better defined before being included 
in such perspective. The generative power of these grammars with shortest move 
condition is still open. 

This article is a first step to several perspectives which make a strong link 
between a well defined framework with many linguistic properties and a new one 
which captures this framework and proposes a syntax-semantics interface. 
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Abstract. In this paper, we aim at understanding the derivations of 
minimalist grammars without the shortest move constraint. This leads 
us to study the relationship of those derivations with logic. In particular 
we show that the membership problem of minimalist grammars without 
the shortest move constraint is as difficult as provability in Multiplica¬ 
tive Exponential Linear Logic. As a byproduct, this result gives us a 
new representation of those derivations with linear A-terms. We show 
how to interpret those terms in a homomorphic way so as to recover 
the sentence they analyse. As the homorphisms we describe are rather 
evolved, we turn to a proof-net representation and explain how Monadic 
Second Order Logic and related techniques allow us both to define those 
proof-nets and to retrieve the sentence they analyse. 

Since Stabler defined Minimalist Grammar^] (MGs) as a mathematical account 
of Chomsky’s minimalist program, an important effort of research has been ded¬ 
icated to give a logical account of them. MGs use a feature checking system 
which guides the derivations and as those features behave similarly to resources, 
it seemed possible to represent those derivations in some substructural logic. 
There have been a lot of propositions (among others JT], [2j, [3], |3], [3|), but 
we do not think that any of them establishes a relation between MGs and logic 
in a satisfactory way. These propositions are, in most cases, describing a way of 
building proofs in a certain logic so as to describe minimalist derivations. But 
they cannot be considered as a logical account of minimalist derivations since 
they use extra and non-logical constraints that rule out proofs that would not 
represent a minimalist derivation. Those propositions solve nevertheless some 
problem that is inherent to Stabler’s formalism. Indeed, in Stabler’s formalism, 
the derivations are ambiguous in the sense that they can be interpreted into 
different sentences that have different meanings. Thus when dealing with se¬ 
mantic interpretations, one needs to interpret derivations both syntactically and 
semantically so as to build the syntax/semantic relation. 

In the present paper, we give a logical account of minimalist grammars as 
proofs in Multiplicative Exponential Linear Logic (MELL) [6]. We claim that this 
account is accurate and of logical nature for two reasons; first, because we prove 
that the membership problem for minimalist grammars is Turing-equivalent to 
provability in MELL; second, we define minimalist derivations as being all the 

1 Across the paper, unless stated otherwise, when we refer to Minimalist Grammars, 
we are referring to Stabler’s Minimalist Grammars without the Shortest Move Con¬ 
straint. 


S. Pogodalla, M. Quatrini, and C. Retore (Eds.): Lecomte Festschrift, LNAI 6700, pp. 81- |ll7,] 2011. 
(c) Springer-Verlag Berlin Heidelberg 2011 
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proofs of a particular sequent (in the article we actually give the equivalent 
presentation in terms of closed linear A-terms of a certain type). Nevertheless, 
even though linear logic is dealing with resources, in our approach, linear logic is 
not modeling the fact that features are treated as resources in MGs. While this 
is somehow a defect of our approach, it shows that MGs’ derivations are dealing 
with other kinds of resources that we call moving pieces , but which correspond 
to the linguistic notion of traces. 

The idea that has motivated this work is an idea that is not so wide-spread in 
the community of computational linguistics. It consists in making a clear distinc¬ 
tion between derivations and surface realisations. This idea can be traced back to 
Curry [7], but is very common in compiling and has been recently reintroduced 
in computational linguistics by the works of Muskens [8] and de Groote [9]. So we 
start thinking about Minimalist Grammars only from the point of view of their 
derivations, trying to find a good representation and to understand how to build 
them. We continue by studying how to retrieve the surface form, or the string, 
that the derivation is analyzing. This step is harder than one would expect, but 
it also shows an interesting feature. Indeed, we have not been able to find a way 
to interpret our derivations without the use of a context which is quite similar 
to the context that de Groote proposes for semantics [T0|. Finally since, so as to 
find a more satisfactory way of reading sentences out of derivations, we turn to 
Formal Language Theory and use techniques related to Monadic Second Order 
Logic (MSO). This leads us to a fairly simple account of both the structure of 
derivations and the way of interpreting them. 

The paper is organized as follows. In Sect. |T|we introduce linear A-calculus 
and minimalist grammars. We show that the languages defined by minimalist 
grammars are closed under intersection with recognizable sets. This allows us to 
prove the Turing-equivalence of the emptiness problem and of the membership 
problem for MGs. In Sect. [2] we show that the emptiness problem for MGs is 
Turing-equivalent to provability in MELL. We proceed in two steps, first we 
show that the emptiness problem for a particular class of automata, /c-VATA, 
can be reduced to the emptiness problem for MGs. As the emptiness problem for 
fc-VATA is as difficult as provability in MELL, this reduces provability in MELL 
to the emptiness of MGs. Second, we show an encoding of minimalist derivations 
as linear A-terms and we study some consequences of that encoding. Section [3] 
shows how the representation of minimalist derivations as linear A-terms can 
be interpreted into sentences and the limitations of this interpretation. Then 
Sect. |U tries to overcome those limitations with Monadic Second Order Logic. 
Section [5] gives some conclusions on this work. 

1 Preliminaries 

In this section we introduce two technical notions the linear A-calculus and min¬ 
imalist grammars. The A-calculus has been introduced so as to define a theory 
of functions. But it captures the notion of binding and has therefore been exten¬ 
sively used in formal semantics of natural languages. For the syntax of natural 
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languages the linear A-calculus can be seen as a representation of deduction of 
type logical grammars via the Curry-Howarcl isomorphism. A more explicit and 
systematic use of the linear A-calculus in syntax is proposed by the Abstract Cat- 
egorial Grammars [9] and the A-grammars [ 8 ] . The interest of linear A-calculus 
in modeling syntax is that it naturally extends the notion of syntactic tree by 
providing it with the possibility of representing traces with linear A-abstraction. 
So even though there seems a priori to be very little relationships between min¬ 
imalist grammars and linear A-calculus, they can at least be related by the fact 
that traces occupy a central position in minimalist grammars and that linear 
A-calculus offers the possibility to represent traces. 

1.1 Linear A-Calculus 

We now present the linear A-calculus. Linear types are built from a given finite 
set of atoms A by using the infix operator —o. The set of types built from A, 
71*, (A), is constructed according to the following grammar: 

7A,(A) ::= A\(T —° T) 

We adopt the convention that —° associates to the right and that a\ 
ot n —°P represents the type (07 —<>■•• (a n (3) ■ • •). As usual ord(a), the order 
of a simple type a of 71_o(A), is defined to be 1 when a is atomic {i.e. a is an 
element of A), and max(ord(ai) + l,ord(a2)) when a = ol\ —° 012 - 

A higher order signature H, is a triple (A, C , r) where A is a finite set of 
atoms, C is a finite set of constants and r is a function from C to 7A,(A). A 
signature is said of n th order if ma x c£ c{ord(T(c)) < n. We use a type system a 
la Church , which means that variables explicitly carry their types. We adopt the 
notation x“, to specify that a: is a variable of type a. The family 4 ) 

is defined by: 

1. ce A^ when ceC, 

2. x a e A“ , 

3. (£i£ 2 ) £ A£ if t\ e A^ a , £ 2 £ A^. and FV(ti) D TV(£ 2 ) = 0, 

4. Aa -A.t £ A ^ -0 ' 3 if £ e A^ and aA £ FV{t). 

where FV(t) is the set of free variables (defined as usual) of £. The A-terms 
that are in are said linear , because a variable may at most have one free 
occurrence in a term and because every bound variable has exactly one free 
occurrence below the A that binds it. 

When they are not relevant or when they can easily be infered from the 
context, we will omit the typing annotations on the variables. We also use the 
convention that £ 0 £i... t n denotes the term (• • • (£o£i) • ■ • t n ) and that Aii... x n .t 
denotes the term Axi.... A x n .t. We take for granted the notions of a-conversion, 
/3-contraction and 77 -contraction. We always consider A-terms up to a-converti- 
bility, and we respectively write (with 7 G {/3; 77 ; f3rf\) —> 7 , —> 7 , = 7 , the relation 
of 7 -contraction, 7 -reduction and 7 -conversion. A term is closed when its set of 
free variables is empty. 
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Contexts are A-terms with a hole which are written C'[]. The operation of 
grafting a term N in the hole of a context C[] is written C[-ZV]. For example, if 
C[] = Ax.[] and N = x then C[N] = Xx.x. 

The linear A-calculus is a conservative extension of the notion of ranked trees. 
A signature E = (A, C, r) is said to be a tree signature when A = {o} and 
for all c £ C, r(c) is of the form o —° • • • —° o —° o. We will in general write 
o n —o o for the type o —q • • • —q o —° o (when n = 0, o n —° o simply denotes 

nx 

o). Trees are then denoted in the obvious way by closed linear A-terms of type 
o in normal form. Tree signatures may also be called ranked alphabets. We may 
denote with E^ the set of constants declared in E which have type o n —° o. 
If E i = ({o},Ci,ti) and E 2 = ({o},C2,72) are two ranked alphabet such that 
Ci fl C 2 = 0 we write E\ U E 2 to refer to the ranked alphabet ({o}, Ci U C 2 , t\ U 
t 2 ). A multi-sorted tree signature or a multi-sorted ranked alphabet is simply a 
second order signature. When we deal with ranked trees, we assume that they 
are represented by linear A-terms in normal form and we represent a subtree of 
a tree t as a pair (C[],u) such that C[v] = t. 

1.2 Minimalist Grammars 

A minimalist grammar G is a tuple (V, B, F,C,c) where V is a finite set of 
words, B is a finite set of selection features, F is a finite set of licensing features, 
C is a lexicon (a finite set of lexical entries that are defined below) and c £ B. 
Features are used in two different forms, a positive form and a negative form. 
The positive form of a selection feature b ( resp. licensing feature /) is denoted 
by =b (resp. +/) while its negative form is denoted by b (resp. — /). The set of 
positive (resp. negative) features of G will be denoted by B + and F + (resp. B~ 
and F~). 

The elements of C, the lexical entries, are pairs (v, l) where v £ V U {e} and 
l is a string built using symbols taken from B~ U B + U F~ U F + . These strings 
are not arbitrary, they have a special structure, they are of the form l\al 2 where: 

1. a £ B~, 

2. l\ is a string (possibly empty) of elements taken from B + U F + and which 
must start by an element of B + when it is not empty, 

3. l 2 is a string (possibly empty) of elements taken only from F~. 

The set of feature suffixes of G is the set Suff(G) = {l 2 \3(v, l±l 2 ) £ C}, the 
set of moving suffixes of G is the set Move(G) = {l £ Suff(G)|Z £ (F~)*}. A 
lexical construction is a pair (w, l) such that w £ V* and l £ Suff(G); a moving 
piece is a lexical construction (w, l) such that l £ Move(G). 

The derivations of minimalist grammars G are defined on a tree signature of 
the form: Der(G) = ({o}, {merge; movejLJG, p) where p(merge) = o o o, 
p(move) = 0^0 and p(c ) = o when c £ C. The set of trees that can be built 
on Der(G) will be written d(G). 
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In order to produce the strings that derivations are representing, we use 
a transformation H that interprets the elements of d(G) as pairs (s,L) 
where: 

1. s is a lexical construction, the head of the derivation, and, 

2. L is a finite multiset of moving pieces. 

We consider multisets built on a set A as functions from d to N. Such a mul¬ 
tiset L is said finite when X^ a e 4 ^( a ) finite- Given a and a multiset L we 
say that a has L(a) occurrences in L. We will confuse a and the multiset L a 
which contains one occurrence of a and no occurrence of elements different from 
a. We write 0 to denote the multiset which contains no occurrence of any el¬ 
ement. Given two multisets L\ and L 2 we write L\ U L 2 the multiset such 
that (L 1 U L 2 )(a) = L\(a) + L 2 (a). We may represent finite multisets L with 
a list notation [ei,..., e n ], with the understanding that for each a there is ex¬ 
actly L(a) e, that are equal to a . The fact that we use multisets of moving 
pieces is a first hint for understanding the relation between MGs and MELL. 
Indeed, contexts of hypotheses in MELL are best represented as multisets of 
formulae. 

The transformation TL is defined as follows: 

1. 7d(merge t\ t 2 ) = {(w2,h), («d, h) U L\ U L 2 ) if H(t{) = ((w\, ah), Li), h is 
not empty and W(t 2 ) = ((w 2 , =aZ 2 ),L 2 ), 

2. 7d(merge t\ t 2 ) = {(wiw 2 , h), Li U L 2 ) if H{ti) = ((wi, a), Li), 7Y(< 2 ) = 
{(u> 2 , =aZ 2 ), L 2 ) and f 2 is not an element of C 

3. 7d(mergeti t 2 ) = ((ui 2 wi, i 2 ), Li U L 2 ) if H(t\) = ((wi, a), Li), 7 i{t 2 ) = 
((u> 2 , =aZ 2 ), L 2 ) and t 2 is an element of C 

4. let’s assume 7Y(fi) = (( w,+al ), (w', —al’) U L) then, 

\ _ / ((.w, l), (w',l') U L) when /' is not empty 
1 \((w'w,l),L) otherwise 

5. in the other cases H(t) is undefined. 

In this way G defines two languages: 

1. the language of its derivations V(G) = {t\H(t) = {(w, c), 0)}, 

2. the string language C(G) = {w € V*\Bt.H(t) = {(w, c),0)} 

Example 1. In the course of this paper, we will use an example adapted from 
m with a grammar using the following lexical entries: 

(Maria, d —case), (speak, =d=dv), (will, =v Tease c), (Nahuatl, d) 


With this grammar we can give an analysis of the sentence Maria will speak 
Nahuatl with the derivation t that is represented by the following term: 
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merge 

/ \ 

merge (will, =v +casec) 
(Maria, d -case) merge 

(Nahuatl, d) (speak, =d =d v) 


We here give the details of the computation of TL(t): 

1. let u\ = merge (Nahuatl, d) (speak, =d=dv) then (case 3 of the definition) 

TL(u\) = ((speak Nahuatl, =dv), 0) 

2. let now 112 = merge (Maria, d— case) u± we have that (case 1 in the defini¬ 
tion) 

H(u 2 ) = ((speak Nahuatl, v), [(Maria, —case)]) 

3. let U 3 = merge U 2 (will, =i> Tease c) and then (case 2 of the definition) 

TL(u 3 ) = ((will speak Nahuatl, +casec), [(Maria,—case)]) 

4. finally (case 4 of the definition) H(t) = ((Maria will speak Nahuatl, c), 0) 

An element t from d(G) is said to satisfy the Shortest Move Constraint (SMC) 
if TL(t') is defined and is of the form (s, L) where for each licensing feature / of 
G there is at most one occurrence in L of a moving piece of the form (w, —fl). 
A term t is said to hereditarily satisfy the SMC when t and each of its subterm 
satisfy the SMC (we write that t is HSMC). 

With the SMC, G defines two languages: 

1. the language of its SMC-derivations T>smc(G) = {t\H(t) = ((iu,c),0) and t 
is HSMC}, 

2. the string SMC-language Csmc{G) = {w G W*\3t G 1’>SMc{G).H(t) — 

(Kc),0)} 

In the general case (with derivations that do not satisfy the SMC), the map¬ 
ping TL cannot be seen as a homomorphism. Indeed, the interpretations of merge 
or move via TL lead to functions which have to inspect their arguments in order 
to possibly compute a result. Moreover, the interpretation of move is not deter¬ 
ministic, since one can pick any element in the multiset of moving pieces which 
exhibits the required feature. There would be an easy way of turning TL into a 
homomorphism, simply by: 

— distinguishing the domains in which elements of C and complex expressions 
are interpreted by TL 

— interpreting the term t as the set of pairs (s, L) in which TL can interpret t 
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But, this technique does not help to grasp any interesting criterion, apart 
from the actual computation, that allows to understand in which cases 7 i.(t) 
gives an actual result. Thus this presentation of minimalist grammars does not 
give a satisfactory account of the mathematical nature of the derivations and can 
be seen as an algorithm that computes derived structures. Another reason why 
this technique of turning TL into a homomorphism is not worthwhile is that in 
general minimalist grammars are not only concerned with syntax but also with 
the interface between syntax and semantics. And when 7 H.(t) outputs several 
results, these results should in general be put in relation with different semantic 
representations. Therefore, derivation terms do not determine completely the 
relation between syntax and semantics, this relation really depends on the actual 
computation Ti is doing on the terms. So that understanding the mathematical 
nature of the derivations of minimalist grammars should lead to the definition 
of derivations on which it would be both possible to check easily whether they 
denote a correct syntactic object and to relate this syntactic object uniquely to 
some semantic representation. 

Example 2. We here give an artificial example of an ambiguous derivation. We 
use a grammar with the following lexical entries: 

(a, =ai=a 2 b), ((3, =b+c+c+cb), ( 71 , =da\ — c—c), ( 72 , a 2 — c), (5, d ) 


and we build the derivation t represented by the term: 

move 

move 


merge 

merge (/?, =b+c+c+cb) 
(72, a 2 -c) merge 


merge (a,=ai=a 2 b) 

( S,d ) ( 71 , =dai—c—c) 

We can now compute the possible values of H(t): 

1. let u\ = merge(merge(i5,<i )(71 , =da\—c—c)){a, =a\=a 2 b) then H(ui) = 
((a, =a 2 b), [ 7 i( 5 , -c-c)]), 

2. let u 2 = merge (merge ( 72 , a 2 —c—c) ( u \)) ((3, =b+c+c+cb), we easily ob¬ 
tain that 

7 i(u 2 ) = ((/3a, +c+c+cb), [(71 <5, -c-c), ( 72 , —c)]> 

3. let U 3 = move(« 2 ) then we have two possible results for H(u^)\ 

(((3a, +c+cb), [( 7 i 5 , -c), ( 72 , -c)]) and ((7 2 (3a, +c+cb), [(71 <5, -c-c)]) 
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4. let zi 4 = move(u 3 ), there are two possible results for 

(( 7 i 5 / 3 a,+c 6 ),[( 72 ,-c)]) and (( 72 /3a, +cb), [( 71 5, -c)]) 

5. finally there are two possible results for 'H(t): 

((7271 SPa,b),9) and (( 7157 2 / 3 a, 6),0) 

We now show that the emptiness problem for minimalist grammars is Turing- 
equivalent to the membership problem. First, the emptiness problem can be 
reduced to the membership problem simply by replacing every element (v,l) 
in C by (e,Z), it is then trivial to check that the emptiness problem of the 
former grammar is equivalent to the membership of e to the language of the new 
grammar. We can then state that: 

Lemma 1. If one can decide whether a sentense s belongs to the language of a 
minimalist grammar G then one can decide whether the language of a minimalist 
grammar is empty. 

In order to prove the converse property we show that the class of languages 
that can be defined with minimalist grammars is closed under intersection with 
recognizable sets of strings. 

Lemma 2. Given a minimalist grammar G = ( V, B, F,C,c ) and a regular set 
of strings Reg C V*, then there is a minimalist grammar G' whose language is 
the intersection of the language defined by G and Reg. 

Proof. Let us suppose that Reg is recognized by the following deterministic finite 
state automaton A = (V,Q,6,qi n it,Qf) where S is the transition function from 
V x Q to Q (we make the confusion between S and its homomorphic extension 
to the free monoid V*, i.e. we consider that <5(e, q) = q and that, for w from V*, 
S(w,q) is the state that the automaton reaches when reading w from state q), 
<7init £ Q is the initial state and Qf C Q is the set of final states. 

We define G' = ( V ., {d} U BF ', C', d) with B' = BxQxQ,F' = FxQxQ 
and d not in B’~ U B’ + U F'~ U F' + . We let 

c' = {(e,=(c,q init ,q)d)\q £ Q f } U |J ip{v,l). 

(v,i)eC 

where <p(v,l) is defined as follow^: 

1 . I is of the form =beibi... ekbka—hi... — /i„+i then we will have 

<p(v,l) = {{v,=(b,q 0 ,q)e 1 (bi,q 2 ,q 1 )... e k {b k , q k+1 , qk)(a, q' 0 , q' 0 ) 

-{hi,q'i,q[) ■. .-{h n ,q' n ,q' n )-(h n+1 ,q k +i,q))\ 

q,q 0 ,...,q k+1 ,q' 0: ...q' n £ QAd(v,gi) = q 0 } 


In the definition of tp, we adopt the convention that e, is = when bi is in B and + 
when bi is in F. 


2 
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2. I is of the form ... ekbkCi then we will have 

ip(v, l ) = {(v, =(b, g 0 , g)ei(6i, g 2 , gi) ■ • ■ e k (b k ,qk+i,Qk)a(qk+i,q))\ 
9,9o, ■ • •, 9fc+i € Q A 5{v, qi) = g 0 } 

3. I is of the form a—hi ... — the we let: 

ip{v,l) = { 0 , (a,go,go)—Oi,gi,gi) • ■ • ~{K, q' n , q' n )-(K+i, gi, go))| 
9o, Qii Qoi ■ ■ ■ i q' n € Q A 5(v, gi) = g 0 } 


4. Z is of the form a then we let: 

= {0, O,9i,9o))|go,gi e Q A(5(u,gi) = go} 

We here give an rough explanation about the definition ip(v,l). If we look at a 
lexical entry produced as in the first case of that definition, it will be used to 
build a lexical construction of the form 

{{w k w k - 1 ...WiVWo, (a, q' 0 , q'o)—(hi,q' 1 ,q' 1 )... -(h n , q' n , q' n )-(h n+1 , q k , g)), M) 

where the word Wq (possibly the empty string) comes from the lexical con¬ 
struction to which the lexical entry is first merged and the Wi (possibly empty 
strings) are the words that are put in front of the lexical construction through 
successive merge and move operations. The construction we give guaranties 
that S(wi 7 qi+i) = g,; when i > 0 and <$(u>o,go) = g so that, knowing that 
S(v, q±) = go, we have that S(wkWk-i ■ ■ ■ w\vwq 1 qk+i) = g. This can help to 
understand how the states are related to each other in the positive part of the 
list of features of the lexical entry. Afterwards, this lexical construction can be 
merged and then moved several times in another lexical construction leaving the 
head of this construction unchanged until the final move operation. Thus in the 
negative part of the list of features, the first negative features just contain pairs 
of identical states because they correspond the fact that the head of the lexical 
construction in which it is a moving piece is left unchanged. Then, when the last 
move operation happens, the string WkWk-i ■ ■ .wivwo will be put in front of 
the head and the fact that when reading it the automaton goes from state qk to 
state q must be consistently used. 

Let’s now follow this intuition and turn to a sketch of a proof that C(G') is 
equal to C(G ) D Reg. 

In each of the cases of the definition ip(v, l ), if s is in ip(v, l) (we suppose that 
s is written as in the cases defining the set ip(v,l)) then we write range(s) for 
the pair of states (gi, go). 

Given a list of features l from SufF(G') we define rg(l) as follow^]: 

1. if l starts with a positive feature and l = e(b, g, gi)/V(/, g', go) ( l ' being a 
list of features and e'(/, g',go) a negative feature), then rg(l) = (gi,go) 


3 We adopt the convention that e is either = or + and t is either — or empty (when 
the feature eb is a negative base feature). 
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2. if l does not start with a positive feature, then l = qi,qo) and rg{l) = 
(<?i,9o)- 

Given an element t of Der(G'), such that H(t) = (( w , l),M) we write range(f) 
for the pair of states defined as follows: 

1. if t is of the form s where s is in G then range(f) = range(s), 

2. otherwise range(f) = rg(l). 

For a list of features l of G, we write I for the list of features of G such that 
e(b, q, q')l' = ebU. 

An easy induction on t' in d(G) proves that if the following properties are 
verified: 

1. H(t') = ((w,l),[(wi,h),...,(w n ,l n )]), and 

2. range(f') = rg(h) = (qi,q[), rg{l n ) = ( q ni q' n ) 

then we have: 

1. 5{w,q) = q', 5{w 1 ,q 1 ) = q[, ..., 5{w n ,q n ) = q' n , _ _ 

2. there is t, in d{G) such that H(t) = ((w, l), [(toi, h ),..., ( w n , l n )])• 

Another simple induction on t in d(G) shows that whenever 
H(t.) = (( w, l ), [(iui, Zi),..., 0„, /„)]} 

then for every pairs of states ( q , q'), (qi,q[), ■ ■ ■, ( q n , q' n ) such that 8{w, q) = q ', 
S(wi,qi) = q[, 8(w n , q n ) = q^ and every V, l[, l' n from Suff(G'), 

such that range(Z') = ( q,q ') and V = l, range(? , 1 ) = (qi,q[) and l[ = li, 
range(^) = (q n ,q' n ) and l' n = l n there is t' in d{G') such that H(t') = 

These two properties have the consequence that a term t' from d{G') verifies 
7Y(t / ) = ((w, (c, qi n it, qf))i 0) with qf in Qf if and only if w is in C(G) CiReg. Thus 
a sentence w is in C(G') (i.e. there is t' in d(G) such that H(t') = (( w , d), 0)) if 
and only if w is in C(G) fl Reg. 

Thus the class of languages defined by minimalist grammars is closed under 
intersection with regular sets. 

Note that this proof gives an actual construction of G and has therefore the 
next lemma as a consequence. 

Lemma 3. If the emptiness problem for minimalist grammars is decidable then 
the membership problem for those grammars is decidable. 

Proof. If we want to know whether w belongs to the language defined by G, since 
{w} is a regular set, we construct G the minimalist grammar whose language is 
the intersection of the language of G and of {iu}. The language of G is empty 
if and only if w belongs to the language defined by G. 

Theorem 1. The emptiness problem and the membership problem for minimal¬ 
ist grammars are Turing-equivalent. 
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2 Minimalist Grammars and MELL 

We are now going to show that provability in MELL is Turing-equivalent to 
the emptiness problem for minimalist grammars. We prove each direction of the 
equivalence separately. 

First we reduce the provability in MELL to the emptiness of MGs. For this 
purpose we use a class of tree automata, fc-VATA, introduced in [T2], and which 
generalizes the notion of Vector Addition Systems (VAS) which are equivalent to 
Petri nets. It is proved in jT2] that the decidability of the emptiness problem for 
fc-VATA is equivalent to the decidability of the provability of sequents in MELL. 

Second we show how to represent derivations in MGs as linear A-terms built 
over a certain signature. It is well-known [ 12] that deciding provability in MELL 
is equivalent to deciding the existence of such linear A-terms. 

2.1 Emptiness of fc-VATAs Reduced to Emptiness of MGs 

A fc-VATA is a tuple (S,Q,S,Cf) where: 

1. A is a tree signature, 

2. Q is a finite set of states, 

3. 8 is a finite set of rules of the form: 

n 

/(<?l,Xi) ... (<7n,x n ) —> (g,^(x* - z i) + z) 

ie i 

where / is a constant of A and are variables and z i and z are elements 
of N fe . 

4. Cf is a finite subset of Q x N fc , the accepting configurations 

For a fc-VATA, a configuration is an element of Q x N fc , a fc-VATA is rewrit¬ 
ing terms built on the tree signature A and that can have as leaves configura¬ 
tions. Thus given a rule of the considered fc-VATA, a tree t which is equal to 
C[f(qi, Pi) ■ • ■ ( q n ,Pn )] and a rule 

n 

/(< 2 i,xi)... (g„,x„) —> (g,^(xj - zf) + z) 

i =1 

then it rewrites t to t' = C[(g, J]" =1 (pj — z i) + z )] provided that for all i in 
[1; n], p, — z t is an element of N fe . In such a case, we write t — t' and 
denotes the reflexive and transitive closure of —The language of a fc-VATA 
A = (A, Q, 8 , Cf) is the set C{A) = {t £ T(A)|f —(q , p) A (g,p) £ Cf}. 

For a given fc, we write 0 ( resp. e^) to denote the element of whose com¬ 
ponents are all zero (resp. except the i th which is 1). A fc-VATA is in normal 
form if it has only one accepting configuration which is of the form (g, 0) and if 
all its rules are in one of the following form: 

1. c —> (q,e.i) for some i in [1; fc], 

2. /(go, x) —> (g, x — e*) for some i in [1; fc], 

3- /((gi,xi),(g 2 ,x 2 ))—>(g,xi+x 2 ) 
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As it is showed in ca, the emptiness problem for fc-VATA in normal form is 
as difficult as for general fc-VATA. Furthermore, the theorem is proved by giving 
an effective construction. 

Theorem 2. Given a k-VATA A, there is a k-VATA B in normal form such 
that C(A) is empty if and only if C{B) is empty. 

We can now reduce the emptiness problem of a fc-VATA in normal form to 
the emptiness of a minimalist grammar. Suppose that we are given a fc-VATA 
in normal form A= (E,Q,S,{(qf, 0)}), we construct the following MG Ga = 
(0, Q, [1; fc], C, qf) where C contains the following entries: 

— (e, g— i) when there is a rule of the form c —> (q , e,;) in <5 

— (e, =gi=g 2 g) when there is a rule of the form/((gi, xi), (g 2 , x 2 )) —> (g,Xi + 
x 2 ) in S 

— (e, =go+ig) when there is a rule of the form /(go, x) —» (g, x — e*) in 6 

We are going to prove that £(Ga) is empty if and only if £(A) is empty by 
giving an interpretation of the derivations of G j .1 as configurations of A. Given 
t from d{Gj\), t can be interpreted as an element of when H(t) is defined, 
then the component of that vector is the number of occurrences of the feature 
—i in H(t). We write V(t) for the vector denoted by t when it is defined. The 
state of t, denoted by Q(t), is the element of Q such that H(t) = (( v,ql),L ). 
Note that Q(t) is not defined when H(t) = (( v,lql'),L ) and l is not empty. 
Thus t in d(G_ 4 ) is interpreted as a configuration of A by conf(f) = {Q{t),V(t)). 
This configuration is defined only when Q(t) is defined (note that when Q(t) is 
defined, obviously H{f) is defined and thus so is V(f)). 

Lemma 4. Given v € N fe and q £ Q, there is t in d(G^) such that conf(f) = 
(g, v) if and only if there is a term t' such that t 1 —(g, v). 

Proof. We first remark that whenever conf(t) is defined, then t is in one of the 
three following forms: 

1. t = (e, q—i), where (e, q — i) is in C, 

2. t = merge t 2 {mergeti(e 1 =q 1 =q 2 q)) where fci(ti) = ((e, gAi), Lf) and where 
T~L{t 2 ) = ((e, g 2 i 2 ))-^2)5 

3. t = move(merge« (=g 0 +ig)) where H(u) = ((e, go/), L) and the i th compo¬ 
nent of V(u) is strictly positive. 

We now prove the existence of f by induction on t. 

In case t = (e,g— i), then, by definition of Ga there is a rule in <5 which is of 
the form c —> (g, e,;). Then it suffices to take t' = c. 

In case t = merget 2 (mergeti(e, =gi=g 2 g)) then, by induction hypothesis, 
we have the existence of t\ and t’ 2 such that t\ —conf(fi). Moreover, by defi¬ 
nition of C, there is a rule of the form /((gi,xi), (g 2 ,x 2 )) —* (g,xi +x 2 ) in C. 
We then let t' be / 1\ t ' 2 . 

In case t = move(merge u (=go+ig)), then, by induction hypothesis, there is 
u! such that v! —->a conf(u). Furthermore, we know that the « th component of 
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V(m) is strictly positive, and that, by definition of C, there is in 6 a rule of the 
form /(q 0 ,x) —> (®n x — e*). We can then choose t' to be f(u'). 

The proof of the converse property is using a similar induction. 

The parallel that is drawn between minimalist grammars and fc-VATA allows 
us to give a negative answer to the conjecture raised in JT3] that MGs (without 
SMC) define only semi-linear languages. To prove it we use the notion of fc-VAS 
(fc- Vector Addition Systems). A fc-VAS can be seen as a fc-VATA whose signature 
contains only nullary and unary operators, and, given a state q , the sets of vectors 
that are accessible at q for a fc-VAS A is Acc(A, g) = {v|3£.f —> A (g,v)}. It is 
known that the sets of the form Acc (A, q) may not be semi-linear [14]. As for 
fc-VATA, there is a normal form for fc-VAS, where the rules are of the following 
form: 

1. c —■> (q, 0) 

2 . f(q±, x) —» (^ 2 , x — ej) for some i in [1; fc], 

3. /(gi, x) —» (^ 2 , x + for some i in [1; k] 

The important property is that if A is a fc-VAS, and q is a state of A, then there 
is a fc-VAS in normal form B and a state q 1 of B such that Acc(A, q) = Acc(6, q'). 

So given a fc-VAS in normal form A and its final state p , we can define the 
following MG G a = ([1; fc], Q U d, [1; k],C, d): 

1. (e, q) if there is a rule e —> (q, 0) is in 5, 

2 . (e, =qi+iq 2 ) if there is a rule /(gi,x) —> ( 92 ,x — e.;) in S, 

3. (e, =qiq 2 ~i) if there is a rule f(qi, x) —» ( 92 , x + e,) in S , 

4. (e, =pd) 

5. (i,=d+id) for all i in [1; fc] 

Similarly to the proof of Lemma[4j it can be showed that whenever v is accessible 
at q in A then there is t such that conf(f) = (q, v). Then the lexical entries 
( e,=pd ) and (i,=d+id) (where i is in [1; fc]) transform the vector v into a word 
of [1; fc]* such that, for all i in [1; fc], it contains exactly v, : occurrences of i (if 
Vi is the i th component of v) so that the language defined by G A is the set of 
elements of [1; fc]* whose Parikh image is Acc(A, q). Thus the language of G A is 
semi-linear if and only if the set of vectors accessible by A form a semi-linear 
set. Thus, we have the following theorem. 

Theorem 3. The class of languages defined by MGs is not semi-linear. 

2.2 Representing MG Derivations as Proofs in MELL 

We here give an account of the derivations of an MG with the linear A-terms 
built over a certain signature. It is known (c./. |T2]) that finding such A-terms is 
in general Turing-equivalent to provability in MELL. This encoding thus com¬ 
pletes the proof that the membership problem for MGs is Turing-equivalent to 
provability in MELL. 
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For a given MG, G = ( B , F, W, C, c), we define Eg which declares the follow¬ 
ing set of atomic types: 

1. e(l) if there is ( w,l) in C, 

2. d(l) if l is an element of Suff(G), 

3. h(l) if l is an element of Move(G) 

Even though we use a predicate-like notation, note that since G, Suff(G) and 
Move(G) are finite, there are finitely many types that are declared in Eg- The 
type e{l) represents the type of a lexical entry, d(l) represents the type of a 
derivation whose head is of the form ( w , V) and h(l) is the type of a moving piece 
of the form (w, l). We make the distinction between e(l) and d(l) so as to know 
how to interpret merge in a homomorphic way. 

Eg also declares the following constants: 

1. (w,l) : e(l) if (w,l) is in G, 

2. merge[k(ah),k'(=al 2 )} : k(ali ) —° k'(=al 2 ) —° h{h) d(l 2 ) where h is not 

empty and with k, k' in {d\e} 

3. merge[k(a), ^'(= 0 / 2 )] : k(a) k'(=al 2 ) —° d(fo) with k,k r in {d\e} 

4. move[h(—ah),d(+al 2 )\ ■ ( h(—ali ) d(+al 2 )) h(h) d(l 2 ) where h is 

not empty, 

5. move[h(— a), d(+al)] : ( h(—a ) d(+al)) d{l) 

We will show that there are closed terms of type fc(c) (with k in {ci; e}) if 
and only if C{G) is not empty. Terms of d{G) that can be interpreted with the 
function TL are represented as terms built on Eq whose types are d(l) or e{l) 
and whose free variables have types of the form h(l). 

Lemma 5. There is t € A k ^ (with k being either d or e) such that FV(t) = 

...; Xn^} is derivable if and only if there is t’ in d{G) such that Ti{t') = 
((w,l),[(w 1 ,l 1 ).. .(w n ,l n )]). 

Proof. We first construct t' by induction on t. We suppose without loss of gen¬ 
erality that t is in normal form. 

If t = ( w , l) then it suffices to take t! = (w, l). 

If t = merge[k(ali), k'(= al 2 )\tit 2 t^, then because there is no constant that 
has the type h(l\) as a conclusion and because free variables all have a type of 
the form h{l ), it is necessary that, for some i, t% = and h{U) = h(l). Thus, 

we have that t\ € A k ^' ) with FV(ti) = {x^ l '^\... ; x^ l%p ^}, t 2 G A k J'~ al2 " > 

with FV[t 2 ) = {^ Jl) ;...;a;^ p) } and ({*i; • • •; ip}, {ji; •..; j q }, {!}} forms a 
partition of [l;n]. By induction hypothesis, we get the existence of t\ and t' 2 
verifying the right properties and it suffices to take t' — merge t[ t' 2 - 

If t = merge[k(a),k'(= al 2 )\t\t 2 , then we proceed similarly to the previous 
case. 

If t = move[h{—al), d(+al')\ (Xx h ^~ al \ti)t 2 then, similarly to the previous 
case, we have that t 2 must be one of the x^ l '\ We suppose without loss of 
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generality, that £2 = and then we have that t\ £ A d ^ al ^ with FV[t\) = 

{x^ 2 ^; • ■ •; Xn^ 1 ^', x h (~ a ^}. Then we obtain a t[ from t\ by using the induction 
hypothesis and it suffices to take if = move(t' 1 ) by assuming that move is 
operating on a moving piece of the form (v, —al ) which by induction hypothesis 
must exist. 

If t = move'[h(—a),d(+al)\ Ax.ti then we proceed in a way similar to the 
previous case. 

The converse does not present more difficulty and is then left to the reader. 

This Lemma together with Lemma|4] answers the question of the mathematical 
nature of the derivations of minimalist grammars. It shows that these derivations 
can be seen as closed linear A-terms of type d(c) or e(c). Thus, with such a 
representation, checking whether such a derivation is correct does not amount 
to compute whether it can be interpreted as a string, but merely amounts to 
type checking. 

Example 3. We show here the representation of the derivation of Example [T| as 
a linear A-term: 


(Maria, d 


move[h(—case ), d(+case c)] 

I 

Ax. merge[d(v), e(=v +case c)] 


\ 


merge[e(d —case), d(=dv)\ (will, =v +case c) 


—case) merge[e(d), e(=d=dv\ 


(Nahuatl, d) (speak, =d =d v) 


On the other hand, the derivation of Example [5] can be represented by three 
different linear A-terms (for the sake of concision we erase the squared brackets 
[a,/?] of the constants move[a, f3\ and merge[a, 0\): 

move 


Aj /2 ■ move 


1 . 


Ayi. move 
I 


yi 


Ax. merge 

/ \ 

merge (/ 3 , =b+ c +c+cb) 


( 72 , a 2 ~c) merge 


merge ( a ,=a 1 =a 2 b) 



(■S,d ) ( 71 , =dai-c-c) 
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2 . 


move 

I 

Aj /2 • move 

I 

Xx. move 

/ \ 

Xy\. merge 2/2 


merge (/ 3 , = 6 + c +c+c&) 


( 72 , a 2 -c) merge 
merge ( a ,=a 1 =a 2 b) 


Vi 


(S,d) ( 71 , =da\—c—c) 


move 


3. 


Xx. move 

I 

Xy 2 • move 

/ \ 

Xy\. merge 2/2 

/ \ 

merge (/ 3 , =b+ c +c+c&) 


( 72 , a 2 -c) merge 

merge (a,=ai=a 2 6 ) 

/ \ 

(<5, d) ( 7 i,=ciai-c-c) 



In this presentation of minimalist derivations A-variables represent the mov¬ 
ing pieces of a derivation. When a derivation t\ is merged to another deriva¬ 
tion t 2 and must be moved afterwards then the new derivation is of the form 
merge ti t 2 x where the A-variable x materialises as a moving piece. 

Each time a move operation is applied, it is applied to a third-order term of 
the form Xx.t where x indicates which moving piece is actually moved. When a 
constant of the form move[a, /3\ has two arguments, it means that the moving 
piece that the move operation has moved still has to be moved and then the 
second argument is a A-variable that materialises the actualisation of this moving 
piece. 

In the representation of minimalist derivations we propose, it becomes explicit 
that the move operation corresponds to binding some variable, but we can go 
a little further in the logical interpretation of move. Indeed, it would be possible 
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to define a signature IIq which only contains constants representing the lexical 
entries of G and no operator representing move or merge. The types used by 
IIg are: d(la) where la is such that there is (w, lilal 2 ) in C and l is either starting 
with an element of F + or is empty. For every entry (w, la—fi ... —f n ) in G, IIg 
contains constants of type 

((... (((g(la) -o d(+fih)) g(li) d(+f 2 h)) g(l 2 )) ■ ■ ■) d(+f„l„)) -o g(l„) 

for every possible atomic type of the form d(+fil' i ) and where g(l) is equal 
to d(ai) - 0 --- d(afc) —° d(l'b) if l is of the form =ai... =akl'b and l' if either 
starting with some element of F + or is empty. The idea is that the d(+fil' i ) and 
e,; represent the features that the head has to licence when the i th movement of 
the entry happens. 

Example 4- If we applied such a transformation to the grammar we used in our 
examples then we would get the following type assignment for the lexical entries 
for the derivations we showed: 

(Maria, d— case) : (d(d) —° d(+casec )) —° d(c) 

(■ will,=v+casec ) : d{v) d(+casec) 

(. Nahuatl,d ) : d(d) 

(speak, =d=dv) : d(d) d(d) —° d(v) 

Then the derivation is represented by the linear A-term: 

(Maria, d —case) 

Xx. (will, =v +case c) 

(speak, =d=dv) 

/ \ 

(Nahuatl,d) x 

For the second example, we may represent the derivations given as examples 
with the following constants: 

72 : (((d(a 2 ) ^ d(+c+cb))—o d(+cb)) ^ d(+c))—o d(b) 

72 : ((( d(a 2 ) d(+c+c+cb)) -<> d(+c+cb )) d(+c)) d(b) 

72 : (((d(a 2 ) d(+c+c+cbj) —° d(+c+cb)) -<> d(+c+c)) -<> d(+cb) 

7 i : ((d(d) —o d(ai)) d(+c+c+cb)) d(+c+cb) 

7 i : ((d(d) —o d(ai)) d(+c+cb)) d(+cb) 

7 i : ((d(d) -o d(ai)) d(+cb)) d(b ) 

/3 : d(b) -o d(+c+c+cb) 
a : d(a\) d(a 2 ) d(b) 

5 : d(d) 

With these constants, the derivations can be represented by the linear A-terms: 
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7i 

I 

7i 

| 

7l 

| 

1 

Ay 2 - 2/2 

I 

Ay2-72 

Xx.'yf 

1 

Aj/i-72 

| 

Xx.y 2 

Xyi-yi 

1 

Xx.p 

1 

Xyi-P 

Xyi-0 

1 

a 

a 

a 

/ \ 

/ \ 

/ \ 

X 

y 1 x 

y 1 

I 


6 

1 

5 


We can now understand the technical contributions of move and merge by 
comparing Eg and TIq- First, we remark that in IIg each entry conveys explicitly 
the context in which it is used; in particular it specifies the characteristics of the 
head at each step of its movements. It has the inconvenience that IIq has a 
size that is a 0(|G|" +1 ) where n is the maximum number of movements that 
a moving piece can do whereas Eg is much more compact and has a size in 
0(\G\ 2 ). Furthermore, by making the types of the constants merge[k(al\), k'(= 
al 2 )], move[h(—ali),d(+al 2 )] and move[h(—a),d(+al)] be polymorphic in a, l\, 
I 2 or l we obtain a grammar whose size is linear with respect to the size of 
G. This polymorphism has also the advantage that we can add new entries 
without having to change the grammar in any respect. We can also use a notion 
of polymorphism to IIg, but it needs to be stronger a notion. Indeed, while 
in Eq, polymorphism instantiates differently atomic types, in IIg, because we 
use a function g(l) that gives a complex type depending on the shape of l, this 
polymorphism requires to have a notion of functions from feature string to types. 

A more interesting remark concerning the difference between IIg and Eq 
concerns their respective order. Indeed Eq is a third order signature whereas 
IIg is a signature whose order is between n + 1 and n + 2 where n is the 
maximum number of movements a moving piece can do. If G did not contain any 
moving feature then both Eq and IIg would be second order signatures. Thus 
movement is responsible for the higher order types in Eq and IIq- We can see 
the move operation as being responsible of transforming higher order into third 
order. Transforming higher order into third order is quite usual in intuitionistic 
implicative logics. Indeed, in minimal logic as well as in intuitionisitic implicative 
linear logic, a sequent T \~ a which may contain formulae of any order can 
be transformed in a sequent A b f3 which contains only formulae that are at 
most third order and which is provable if and only if r b a is provable. But 
interestingly if we use the construction that underpins this property on IIg we 
will not obtain Eg as a result. This is due to the fact that move and merge also 
break down the complexity of the polymorphism that would be necessary to make 
IIg be of a reasonable size. The interesting thing about comparing IIg and Eq is 
to show how the linguistic ideas that lead to merge and move operations make 
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drastic simplifications in the mathematical objects used to represent derivations. 
It also gives an opening towards the understanding of the mathematical nature 
of these simplifications. 

3 Interpreting Minimalist Derivations 

In this section we show how to interpret the terms of the signature Eg so as to 
obtain the string that they analyse. In the previous section we have already re¬ 
marked that these terms were explicitly representing via variable binding which 
moving piece is actually moved when a move operation is used. This allows 
us to give a homomorphic interpretation of these terms that yield the unique 
strings that they represent. Thus this representation of derivations makes the 
linearisation of the derivations become independent from their semantic inter¬ 
pretation. This should make it easier to describe the interface between syntax 
and semantics for MGs. 

We have already seen that interpreting trees of d(G ), for an MG G, requires 
that we have a list of moving pieces. In our homomorphic interpretation of the 
terms built on Eg, we will also need such a list. This list will be used very 
similarly to the context that is introduced in uni for semantic purposes. This 
shows that the mapping from minimalist derivations to the surface structure is 
far from being trivial. 

In order to define our context we need to work in a system with at least as 
much computational power as Godel’s system T. We do not give all the imple¬ 
mentation details because they are not of much interest. We wish to convince the 
reader that the homomorphic interpretation of minimalist derivations requires 
a rather sophisticated implementation and technical details would obfuscate the 
reasons why it is so. A syntactic context is defined as a pair ( L,n ) where: 

1. L is a list of pairs ( w,p ) with w belonging to W* and p being an integer, 

2. n is an integer. 

Integers are used here so as to give a name to moving pieces. They allow to make 
the distinction between several moving pieces. Thus, L is a list that associates 
specific integers to moving pieces so as to retrieve them and n is a fresh integer 
so as to be able to extend and search the list. We use the following constants 
and operations on syntactic contexts: 

— the empty list is [], 

— concatenation (L±,pi) • ( L^,P 2 ) = {Li@L 2 , max(pi,p 2 )), where @ is the 
operation of list concatenation 

— adding an element push(e , ( L,p )) = (e :: L,p), where :: is the operation that 
adds an element to a list, 

— incrementation incr(L,p) = ( L,p+ 1), 

— getting a fresh name fresh(L,p) = p + 1, 

— selection sel(k , ( L,p )) which sends the string associated to k in L. 

We now give the interpretation of the types of Eq 
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— e(l) et d(l) are interpreted as being of type str x 7 where str is the type of 
strings and 7 that of syntactic contexts. 

— h(l ) is interpreted as being the type N, the type of integers. 

For the sake of simplicity, we write A(a:i, X 2 ).t and use some let. .. and ... in 
constructions instead of the usual projection operators 717 and 7 T 2 . Furthermore 
as we are interested in the language of an MG and as derivations are interpreted 
as a pair of type str x 7 we introduce two new constants in the signature: realise 1 
of type d(c) s and realise 2 of type e(c) —° s and whose purpose is to get the 
string part of a valid derivation. When t represents a derivation of the MG G 
as a term built on the signature Eg, the resulting string is obtained using a 
homomorphism X on the term realise\{i) or realise 2 {t) depending on whether t 
have type d(c) or e(c). In what follows, as we will need to concatenate strings, 
we will write w • w' the concatenation of two terms that are strings in order to 
avoid confusion with term application. 

The interpretation of the constants of Eg is (in what follows k(l) and k'(l) 
may denote either d(l) or e(l)): 

1. for realize 1 : d(c) —os we have X(realize 1 ) = A (w,L).w 

2 . for realize 2 : e(c) —°s we have X(realize 2 ) = A(to, L).w 

3. for (to, l) : e(Z), we have X((w, l)) = (to, ([], 0)) 

4. for merge[k(ali),k'(=al 2 )\ : k(ali) k’{=al 2 ) h{l\) —o d(Z 2 ) where l\ is 

not empty we have: 

I(m.erge[k(al 1 ),k l (=al 2 )]) = A(toi, si)(to 2 ,s 2 )p.(to 2 , (push (wi,p) (si • s 2 ))) 

5. for merge[k(a),d(=al 2 )] : k(a ) d{=al 2 ) —° ^(^ 2 ) we have: 

I(m.erge[k(a),d(=al 2 )}) = A(tOi,Si)(to 2 ,s 2 ).(tOi • w 2 , (si • s 2 )) 

6 . for merge[k(a), e(=al 2 )] ■ k(a) —o e{=al 2 ) —° d(l 2 ) we have: 

X(merge[k(a),d(=al 2 )}) = A(toi,si)(to 2 , s 2 ).(w 2 ■ w lt (si • s 2 )) 

7. for move[h(—ali), d(+al 2 )\ : (h(—al\) —o d(+al 2 )) h(l\) —o d(J, 2 ) (by 

definition 1 1 is not empty) we have: 

I(move[h(—ali),d(+al 2 )]) = Xfp.fp 

8 . for move[h(—a),d(+al 2 )\ ■ ( h{—a ) ^(+ 0 ^ 2 )) ^(^ 2 ) we have: 

X(move[h(—ali),d(+al 2 )]) = A/.let (_, s) = (/ 0) 

and n = fresh(s) 
and (to, s') = (/n) 
in ((seZ(n, s')) • w, (incr( s'))) 

The only operation that is complicated is the I(move[h(—a), d(+al 2 )]), because 
this is the one where a moving piece stops moving and is incorporated to the 
head. First we retrieve the context s, by giving 0 as a dummy argument to /, this 
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allows us to obtain n as a fresh name to give to the moving piece, we then apply 
f to n and we get the string of the head and a context s' which associates n to 
the moving piece that will be incorporated to the head. The operation sel(n, s')) 
retrieves the string of that moving piece and the context is incremented so that 
in a next use of the context, that moving piece won’t be chosen. We could have 
deleted it from the list, but it is not necessary here. Deletion in list is in general 
a more complex operation than selection in the A-calculus. 

This usage of the context of type 7 is typical of the continuation passing style 
of programming. Even though it is technical, it is quite easy to prove that the 
set 

{w\3t £ A d £ C \t is closed and X(realise 1 1) = w} 

u 

{u>|3f e A e g C \t is closed and X[realise 2 t) = w} 

is equal to C{G)\ the induction is very similar to the one we used to prove 
Lemma |U 

In this paper, for clarity reasons, we deliberately use the simplest notion of 
minimalist grammars as possible. In particular, we omitted weak features that 
are necessary in most reasonable linguistic models. At the level of the deriva¬ 
tion structures, the addition of weak features almost does not change anything; 
changes occur at the level of the interpretation. We will not enter in the details 
of a possible homomorphic interpretation of derivations with weak features, but 
we can say that it is much more evolved than the homomorphism X. 

3.1 The Shortest Move Constraint 

Now we can see that the shortest move constraint can be expressed on minimal¬ 
ist derivations represented as terms built on Eg as the constraint that in any 
subterm of such a term there is at most one free variable having a type of the 
form h{—al) for each licencing feature a. With the Curry-Howard isomorphism, 
we can see linear A-terms as proofs in implicative linear logic which establishes 
judgments of the form fha where X is a multiset of linear types and a is a 
linear type. The restriction that there is at most one free variable having a type 
of the form h(—al) is interpreted in implicative linear logic as constraining the 
possible judgement as being of the form fha where r contains at most one 
occurrence of a formula of the form h(—al). This means that the possible r may 
only contain a number of type that is bounded by the number of movement fea¬ 
tures, the size of T. And thus, there are finitely many possible X that obey this 
constraint. This is the key property that makes minimalist grammars with the 
shortest move constraint languages of multiple context free grammars (MCFL). 

Indeed, because of the finiteness of the possible X and of the subformula 
property, there are only finitely many possible judgements that may have to be 
proved. We can therefore represent the set of all proofs in an algebraic setting; it 
suffices to take all the possible instances of the elimination rules and of the intro¬ 
duction rules of the intuitionistic implicative linear logic. For a given signature 
Eg, we do this by building the following multi-sorted tree signature SMC(Ag): 
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1 . the types are [h(—a\li ),..., h(—a n l n ) b a\i and [T b o\e where —dik is in 
Move(G), the dj are pairwise distinct and a is a subformula of the type of a 
constant in Eq 

2 . c : [bajj for each constant c : a of the signature (c can be either a lexical 
entry or one of the constants representing the move and merge operations. 

3. bi:[rb«-o (3\e [A b cx]e —° [.T, A b /3\e if ot is atomic, 

4. E 2 : [T b a —o (3 ] e —° [A b a]j —° [T, A b (3\e if a is not atomic, 

5. I\ : [f,ab /3\e —° [T b a (3\i if (3 is atomic, 

6 . I 2 : [f,ab j3\i -^[bba-o (3\i if (3 is not atomic. 

To be rigorous, we should have, similarly to the definitions of the constants 
representing merge and move in the signature Eg, several versions of the con¬ 
stants Ei, E 2 , Ii and I 2 for their various possible typing. But for the sake of 
the simplicity of the notations, we simply use those four constants and consider 
that they have several types. 

In order to have a unique representation of each proof we have annotated 
types with I or E, types with an I as subscript are types of proofs that finish 
with an introduction rule, whereas the one with an E are the other cases. The 
representation we have chosen corresponds to /3-normal and 77 -long terms built 
on Eg- 

We now give the interpretation of the terms built on SMC(A7 g) with an ho¬ 
momorphism V that retrieves the A-term of Eq that is represented: 

1. V([ai,..., a n b o\e) = T>{[a\,..., a n b a ]/) = c*i a n a 

2. V(c) = c 

3. V(E 1 ) = Xfgxi... x n yi ... y v -fx \... x n (gyi ...y v ) 

4. V{E 2 ) = Xfgxi... x n yi... y p .fxi... x n {gyi ...y p ) 

5. V{h) = Xf.f 

6 . V{I 2 ) = Xf.f 

We can also define a homomorphism J that transforms a tree t of SMC(2 i/g) 
in the same string as but with the property that every constant is 

interpreted as an affine A-term. The idea behind the definition of J is that we 
can represent a p-tuple of strings (si,..., s p ) that are used to build a string by 
a term of the form P = Xf.fsi... s p , then, as an example, we can use P to form 
a string uqsi • • • w p s p w p + \ simply with the following A-term: P(Xxi-..x p .Wi ■ x± ■ 
...-Wp-Xp- w p+ i). 

1. let’s suppose that the number of licencing features of G is p, then types of 

the form [T b k(l)]M or [T b h(l') /c(Z)]m with k in k in {<3; e} and M in 

{/; E} is interpreted as the type 7 = ( str p+1 —° str) —° str. We furthermore 
assume that the set of licensing features is {/ii;...; a p } so that they are 
implicitly ordered. 

2 . the types of the form [b o\e, where a is a the type of a move constant, are 
interpreted as 7 —° 7 . 

3. the types of the form [b oc\e where a is a the type of a merge constant, are 

interpreted as 7 —° 7 7 . 
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4. then J{(w, l)) = Xg.gw e ... e 

px 

5. J(merge[k(b—akh ), Aj , (=foZ 2 )]) = 

XD\D 2 g .Di(Xs\X\... x p . 

D 2 {Xs 2 yi ...y P . 

gs 2 (xi ■ yi)... (x k -i ■ yk-i)(si) ...(x p - y p ))) 

6. J(merge[k(b ), d(=bl 2 )]) = 

\DiD 2 g.Di(\s\X\... x p . 

D 2 (Xs 2 yi... y p .g(si ■ s 2 ) (xi - yi)... (x p ■ y p ))) 

7. J(merge[k(b ), e(=bl 2 )]) = 

\DiD 2 g.Di(\siXi... x p . 

D 2 {Xs 2 y\... y p .g(s 2 ■ si) (xi ■ yi) ■.. (x p ■ y p ))) 

8. J(move[h(—ak—ajl 1 ), d{+akl 2 )\) = 

XDg.D(Xsx\ ... x p g.g s x±... Xk-ie ■ ■ ■ Xj-\Xk ■ ■ ■ x p ) 

9. J(move[h(-a k ), d(+a k l 2 )]) = 

XDg.D(Xsx\ ... x p g.g (Xk ■ s) x 1 ... Xk- 1 £ ■ ■ ■ x p )) 

10. J(Ei) = Xfx.fx and J(E 2 ) = Xfx.fx 

All together, SMC (Ac) and J define an affine second order string Abstract 
Categorial Grammar in the sense of [15] which also shows that the language 
of such a grammar is the language of a linear second order string Abstract 
Categorial Grammar. But it is showed in jl'6] that such grammars can only 
define MCFLs. 

This construction of an MCFL from an MG with the SMC is not essentially 
different from the one given in ca. but the transformation we propose pre¬ 
serves in an obvious way (thanks to the homomorphism T>) the structure of the 
derivation, so that it preserves the interface between syntax and semantics. Fur¬ 
thermore, the use of tuples can also replace the complex semantic contexts that 
would be necessary without the SMC so that it would become very similar to 
Montague semantics. 

3.2 Saving Computation 

One of the interest of representing the derivation by using the signature Eq is 
that it enables to separate the syntactic interpretation of the derivations from 
their semantic interpretation. Indeed, as in |18) . the semantic interpretation of 
minimalist grammars has to be done in parallel to its syntactic interpretation. 
Nevertheless, this parallel computation shows that if we want to give a semantic 
interpretation of the derivations of minimalist grammars, then we will need to 
implement a context that is at least as complicated as the one we have defined 
for the syntactic interpretation. In order to avoid similar computations that need 
to be accomplished both for the syntactic and the semantic interpretations, we 
can compute an intermediate structure in which all the computations that are 
necessary for both the syntactic interpretation and the semantic one are already 
performed. 
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This intermediate structure is built on a rather simple signature Into whose 
types are the set of features of the minimalist grammar G plus a fresh type 
d for complete derivations, B U F U {d}. The constants are defined as 
follows: 

1. (w, e\b\... e n b n a - fi... - f p ) : ((a fi (bi b n 

fp) d) —o d for every lexical entry (to, ei&i... e n b n a — f \... — f p ) in C, 
where p > 0, 

2. (to, ei&i... e. n b n a) : ((bi —° ■ • • —° ° a) —° d) —° d for every lexical entry 

(io,ei&i.. .e n b n a) in C, 

3. r : c —° d 

In this signature derivations are represented by closed terms of the form: 
ei(A y\ ... y^xt... e n (Xy? ... y™ n x n .rt) 

where the e; are constants and where t is a term built only with the variables Xi 
and y’j. The Xi represent the last place where e, is moved and it is glued there 
with the components that have licenced its features; whereas the j/*- represent the 
traces of e, in the derivation after it has been moved several times. Of course, if 
we take every closed terms of this form, many of them will not correspond to an 
actual minimalist derivation. Moreover, this presentation has some shortcomings 
since it may have several representations of a single derivation. Indeed, without 
any further constraint, if 


ei(Axi y\ . .e„(A x n y™ . ..y^.rt) 

represents a minimalist derivation then, so is 

e r(l)(Ax T (i)?/i • ■ ■ ■ ■ e T (n){*Xr(n)yi ■ ■ ■ Vp^ - r O 

for any permutation r of [1; n]. In order to eliminate these spurious ambiguities 
we can constrain the to appear in the same order as in the surface interpre¬ 
tation of the derivation. 

Example 5. A representation of the derivation of Example [T] as an intermediate 
structure can be given using the constants: 

1. (M aria, d —case) : (d case —o d) —° d 

2. (will, =v +case c) : ((v case c) —o d) d 

3. (speak, =d =dv ) : ((d -<> d v) -<> d) —o d 

4. (Nahuatl, d) : (d —° d) d 

With those constants the derivation is represented by the following A-terms 
(only the first one respects the constraint that eliminates spurious ambiguities): 
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(Maria, d —case ) 

I 

A m d m caS e- (will, =v +case c) 
I 

Xw.(speak, =d=dv) 

I 

Xs.(Nahuatl, d) 

I 

A n d .r 


w 



n m d 


(Nahuatl, d) 

I 

A n d -(speak, =d =d v ) 

I 

As. (Maria, d —case ) 

I 

Amdiricase- (will, =v +case c) 

I 

A w.r 


w 



n m d 


For Example [2] the derivations can be represented using the constants types as: 

1 . (a, =ai=a 2 6 ) : ((a —° a —° b) —o d) —° d 

2. ((3, =b+c+c+cb) : ((b c c c b) d) d 

3. (5, d):(d^> d) d 

4 . (71, =rfai-c-c) : (ai -o c -o c -o d) -o d 
5 - (72, a 2 —c) : (a 2 —° c —° d) —o d 

With this constants, including r : b —° d, we can represent the three derivations 
given in Example [3] with the following A-terms (here we only give the terms 
obeying the constraint that avoids spurious ambiguities): 


(72,02—c) 

I 

Xx2y2-(li,=dai-c-c) 

l 

XxiyiZi.( 5 , d) 

I 

A v.(/ 3 , =b+c+c+cb) 

I 

A/, (a, =ai=a 2 b). 

I 

A g.r 
I 

/ 


(71, =dai—c—c) 
\xiyizi.( 5 , d) 

Xv. (72, a 2 -c) 
Xx2V2-(0, =b+c+c+cb ) 
A/, (a, =ai=a 2 b). 
Xg.r 

f 


(7i,=doi-c-c) 

I 

Xxiy 1 z 1 .(S, d) 

I 

Xv. ( 72 , a 2 -c) 

I 

Xx 2 y2-(P, =b+c+c+cb) 
l 

A f.(a, =ai=a 2 b). 

I 

Xg.r 

I 

/ 


g y 1 ^ 1 2/2 


/\ I 

X\ X 2 V 


g 2/1 2/2 zi 


l\ I 

X\ X2 V 


g 2/2 2/1 zi 


/\ 

X\ X2 V 


Interestingly the variable v that represent the position of the lexical entry (S, d), 
is the argument of the variable z\ which is the variable that represents the last 
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position the moving piece built around ( 71 , =da\— c— c) occupies after movement. 
This has to be contrasted with the terms built in Example 0| where the lexical 
entry ( S , d) is placed as an argument of the variable yi that represents the first 
position occupied by ( 71 , =da\— c—c)). So with this representation of derivations 
every movement has already been performed. 

Remark that the order in which y \, Z\ and z 2 appear as arguments of / 
accounts for the order in which the licencing feature —c of ( 71 , 01 — c—c) and 
( 72 , 02 —c) are checked against the +c feature of (/3,=b+c+c+cb). In particular, 
this enforces a particular order amongst y\ and Z\ that represent the two places 
where ( 71 , 01 —c—c) is moving, y\ being the first and z\ being the second. With 
the choices we made about representing movement order, the following A-term, 
even though it is well-typed, does not represent a derivation since it would mean 
that the second movement of ( 71 , 01 —c—c) has been performed before its first: 

( 7 i) =dai—c—c) 

XxiyxZx^S, d) 

Au.( 72 , a 2 —c) 

Xx 2 V 2 -(P, =b+c+c+cb) 

Xf b .(a,=a 1 =a 2 b). 

Xg.r 

f 


9 92 z 1 y 1 

/\ I 

XI X2 V 

There are several things to remark about this representation of minimalist 
derivations. First of all, contrary to our proposal in MELL, the features are 
treated explicitly as resources by the logic since they are represented as atomic 
formulae. The positive or negative versions are represented by the very same 
atomic type, the way to retrieve whether they are negative or positive amounts 
to find their polarity in the formula. As polarity in linear logic corresponds to 
the fact that a formula provides or requires some other formula as resource, 
the feature checking system of minimalist grammars is adequately modeled that 
way. This fact has been observed in previous works on logical accounts of min¬ 
imalist grammars where people have tried to use polarities to elegantly render 
the feature checking system of minimalist grammars. As we have showed in the 
example, multiplicative linear logic does not seem to give enough control on 
the structure of the proofs so as to define derivations as being all the closed 
terms of a particular type, it explains the reason why those attempts have used 
logics similar to Lambek calculus. But, since this line of research has not sue- 
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ceeded in defining minimalist derivations uniquely by logical means, it seems to 
be a difficult problem to define in logical terms exactly the terms that represent 
minimalist derivations in a signature similar to as Into- 

Another nice property of the representation of derivations in Into, is that, as 
we wished, the homomorphism that interpret those terms into sentences is quite 
simple. Indeed, (w, ei&i... e n b n a — /i... — f p ) would be interpreted as: 


Xg.g^j^Xzx... z n .z n ■ ... ■ z 2 ■ w ■ z{) 

px 


For the semantic interpretation this would also yield to such simple homomor- 
phisms. Thus being able to compute these intermediate representations factors 
out the computation that is common to both the syntactic and semantic inter¬ 
pretations of the derivations. 

Since it does not seem possible yet to define by logical means the element 
built on Into that are representing minimalist derivations, we must use lan¬ 
guage theoretic means and define these elements with a homomorphism from 
Eq to Into■ The definition of the homomorphism computing this intermediate 
representation from the derivation requires a technique very similar to the one 
used for the definition of X which is transforming derivations into sentences. 
The actual implementation is a little more complex, because we need to handle 
lists of traces, but it can be represented using the same computational power as 
for X (i.e. Godel system T). Due to space limitations, we cannot give here the 
technical details of the transformation. 

4 The Point of View of Monadic Second Order Logic 

We have seen that linear A-terms represent adequately minimalist derivations. 
But we have also seen that the interpretation of those terms is not trivial. This 
leads us to the conclusion that terms are not an adequate representation of proofs 
so as to interpret them as strings. We therefore switch to a proof-net represen¬ 
tation of those proofs. We could use the proof-nets that represent the A-terms 
we have defined in the previous section. We will however not do so and use the 
proof-nets introduced by Stabler in m- Stabler’s proof-nets are tailor-made for 
representing minimalist derivations and are therefore more concise than the ones 
we would obtain by a direct representation of the proofs built on Eg- We then 
give the syntactic interpretation of those proof-nets with an MSO-transduction 
which is fairly simple when compared to the previous homomorphisms. This 
shows that graph transformations are more natural than homomorphisms when 
dealing with interpretation of minimalist derivations. This comes from the fact 
that dealing with graphs has the advantage of avoiding the top-down rigidity of 
terms. As there is no directionality, we can easily find the right place where to 
put things. This suggests that MSO-transductions of proof-nets should also be 
used to deal with the semantic interpretation of minimalist derivations. 
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We first start by defining Stabler’s proof-nets as relational structures. Given 
a ranked alphabet G, an i?-relational structure, is a tuple (S, (R a ) agfi) where S 
is a finite set, the carrier of the i?-relational structure, and R a is a subset of S n 
(n being the arity of a). Thus we define Stabler’s proof-nets as i?(G)-relational 
structures where, given a minimalist grammar G, 77(G) is the ranked alphabet 
whose constants are the lexical entries of G and the arity of such a constant (w, l) 
is the length of l. Given an 7?(G)-relational structure 77 = (S, (R( w ,i))(w,i)eR(G))> 
a tuple (xi,... ,xm) that is in R( w g) is called a (u>,Z)-link or a link. We say 
that Xi belongs to that link; the type of Xi in that link is the 7 th feature of 

l. Of course, not every possible 7?(G)-relational structure is going to represent 
a derivation of G and as usual we need a correctness criterion to discriminate 
structures representing an actual derivation from those that do not. If the tuple 
(x \,..., Zo, z i,..., Zji 2 i ) is a (w, lia^-link then, x^ is its 7 th argument and 
Zj is its j th trace , Zq is its initial trace and zm\ is its actual trace. We say that a 

link 1 1 dominates a link I 2 if the initial trace of I 2 is an argument of l\, we then 

* 

write I 2 <3 h (< is the reflexive transitive closure of <). Then the correctness 
criterion can be stated as: 

1 . there is a unique element r, the conclusion of the proof-net which belongs 
to exactly one link, its type is c, 

2 . every element y different from x, belongs to exactly two links p y and n y . y 
is a trace of p y and an argument of n y and if the type of y in p y is a ( resp. 
—f) then its type in n v is =a (resp. +/), 

3. on links the relation <3 forms a tree whose root is p r , 

4. if y 1 and 2/2 are respectively the j\ h and the traces of a link l and if 

j 1 < j -2 then n Vl <3 n m and in case n yi = n V2 , 2/1 being its 7* h argument and 
2/2 being its 7!) h argument, we have 7i < i. 2 - 

The relational structures that satisfy all those properties are called proof-nets. 
It is easy to prove a property similar to sequentialization so as to show the 
correspondence between those proof-nets and the closed terms of Eg of type 
d(c). We do not give the formal proof here as it would not bring anything of 
interest to our exposition. 

Intuitively the first condition expresses that proof-net form a derivation of 
the correct type. The second condition expresses that fact that every feature 
has to be licensed in proof-nets. The third condition enforces the hierarchical 
construction of the derivation. Finally the last condition makes the movements 
of a moving piece be performed in the right order so that the licensing features 
are licenced in the linear order in which they appear in the list of features. 

Example 6. We will give a graphical representation of proof-nets. The links of a 
lexical entry like ( 71 , =da\— c—c) will be represented as a hyperedge like: 


7i 

=d 

a 1 

—c 

—c 


"1 2 3 *4 
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The vertices (the elements of the carrier of the relational structure) are repre¬ 
sented with black dots. As in this example, where they are labeled 1, 2, 3 and 
4, we will use label in order to designate certain elements of the carrier. Here 
the (71, =da\— c— c)-link that is graphically represented is (1,2, 3,4), this link on 
has one argument 1 it has three traces, 2, 3 and 4, the actual trace of the link 
being 4; 1, 2, 3 and 4 respectively have type =d, 01, —c and —c in this link. This 
information is graphically represented in the intuitive way by the origins of the 
tentacles on the hyperedge. 

The derivation of Example |T] is graphically represented by the following 
proof-net: 



It is easy to check that this proof-structure verifies the two first requirements 
for being a proof-net. Concerning the third condition, it is fulfilled since we 
have that the sole (will, =r>+casec)-link dominates the sole (speak, =d=di>)-link 
which dominates the (Nahuatl, d)-link and the (Maria, d—case)- link, and there 
is no other domination relation. Finally the fourth condition has to be checked 
only of the vertices 2 and 3 which are traces of the (Maria, d— case)-link, we can 
see that 2 is an argument of the (speak, =d=dv )~link and that 3 is an argument 
of the (will,=v+casec )~link and that there domination relation agrees with the 
fourth condition. 

The derivation of Example [2] are represented with the following proof-nets: 
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In order to give the syntactic interpretation of those proof-nets, we are go¬ 
ing to use the notion of MSO-transduction (see [30]). An MSO-transduction is 
transforming a L?-relational structure into a A-relational structure as follows: 

1 . first a finite number of copies of the carrier of the initial relational structure 
is done, 

2. for each copy an MSO-formula specifies which elements of the carrier are 
kept, 

3. then for each relation in A, some MSO-formulae specify which tuples of the 
remaining points verify this relation. 

For the sake of simplicity, the MSO-transduction that gives the syntactic inter¬ 
pretation of proof-nets is specified as the composition of two MSO-transductions. 
A first transduction transforms the proof-nets into a string represented as a re¬ 
lational structure. This string may contain some occurrences of e, the empty 
string. The second transduction is simply removing these occurrences of e. This 
second transduction is quite simple and we will not enter into the details of its 
implementation. 

Thus, the first transduction transforms I?(G)-relational structures 7Z that are 
proof-nets into a W(G)-relational structures W where W(G) is the set of binary 
relations W U {e}. We first take two copies of the elements of 1Z and we keep 
every vertex of each copy in the new structure. We write fst(x) and snd(x) two 
respectively denote the first and second copy of x in the new structure. In the 
new structure, we add a relation e(fst(x), snd(x)) if x is not the actual trace of 
p x and w(fst(x), snd(x)) if x is the actual trace of p x which is a (w, Z)-link. 

So as to explain the transduction, we will use the following derivation as a 
running example. 



The first step of the MSO-transduction that we have just described transforms 
this derivation structure into the following structure. We keep the same label 
for the vertices of the first copy of the carrier and we use primed labels for the 































112 


S. Salvati 


second copy, furthermore we have put an arrow on one of the tentacle which sym¬ 
bolise the right of the represented letter (the second argument in the represented 
relation): 






We now have all the ingredients that are necessary to construct the string we 
want. It just remains to concatenate them suitably. This concatenation will be 
performed simply by putting epsilon relations where needed. If we want to con¬ 
catenate the words in which x and y are transformed in W, it suffices to put a 
relation e(snd(x), fst(y)) in VV. This concatenation will be possible only if we can 
express in MSO the relation x < y which is the linear order of the chain we want 
to build. If we look at the level of a (w, Zia^j-link (xi t ..., x\ip , zq, z\, ..., z\i 2 \) 
in order to build the string around t|; 2 |, the actual trace, we need to build the 
string ... S 2 WS 1 if Sj is the string that is constructed around Xi. To do so, 
we need to be able to find the elements of 1Z that start or end the string that is 
built around an element x. To achieve this, we introduce two binary predicates 
follow(:r, y) and precede(;r, y): 


1 . f ollow(a;, y) if and only if x is the actual trace of the link p x , p x has at least 
one argument, and y is the first argument of p x , 

2 . precede(a:, y) holds if and only if x is the actual trace of the link p x , p x has 
at least two arguments and y is the last argument of p x . 


The relations f ollow(:r, y) and precede(a :,y) hold in a proof-net when y is the 
element in p x whose start or end is also the start or the end of the string built 
around x when x is the actual trace of some link. In the example, if we represent 
pictorially the relations follow and precede (the arrow is pointing at the second 
element of the predicate) then we would obtain: 



It is obvious that follow(a;, y) and preced e(x,y) are MSO-definable predi¬ 
cates. We note follow* (x, y) and precede* (x, y) the respective reflexive and 
transitive closures of follow(a;,y) and precede(a;, y). These relations are also 
MSO-definable since transitive closures of MSO-definable relations are also MSO- 
definable. We then define the relations start (x,y) and end (x,y) as being: 
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start(x, y ) = precede* (a;, y) A Vc.precede*(y, z) => y = z 
end(x, y) = follow*(or, y) A V^.follow*(j/, z) => y = z 

On a proof-net the relation start and end define functions, i.e. for every x 
there is exactly one y s and exactly one y e such that start(x, y s ) and end(x, y e ). 
According to the definition, we obtain the following table to describe these re¬ 
lations: _ _ 


X 

Vs 

Ve 

I 

1 

1 

2 

~2 

T 

3 

y 

y 

4 

3 

2 


X 

Vs 

Ve 

5" 

5 

5 

6 

6 

6 

7 

7 

1 

8 

7 

2 


We are now in position to define the relation x < y which says that the trace 
introduced by x appears just before the trace introduced by y. 

If (xi ,... ,x n , zo,..., z p ) is a (w, =b\... e n b n , Zq, ..., 2 p )-link then we have 
that x < y if and only if one of the following holds: 

1. if n > 0, starter, x) A y = z p , 

2. if n > 1, end(x 2 ,x) A y = z p 

3. for some 1 < i < n, end(xj+i,x) A start(xj,y) 

This is enough to define the precedence relation. It is provable that the transitive 
closure of < is a total order when 1Z is a proof-net. On our example the relation 
< would order the set { 1 ;...; 8 } as follows: 

7<1<6<5<8<3<4<2 

Indeed, if we look at 8 the conclusion of the proof-net, it is the actual trace 
of the (/3, = 6 +c+c+c&)-link of the proof-net and, respectively, the first second 
third and fourth arguments of the link are 4, 5, 6 and 7. As we have start(4,3), 
we have 8 < 3, and as we have end(7,1) and start(6, 6) we have 1 < 6, as we 
have end(6, 6) and start(5, 5), we have 6 < 5 and as we have end(5,5) we have 
5 < 8 . Similarly, by looking at every vertex that is the actual trace of some link 
we can complete the relation < as above. Since follow*, precede* are MSO- 
definable relations, then start and end are MSO-definable, and thus < is an 
MSO-definable relation. 

As mentioned previously the relation < describes the way the words repre¬ 
sented by binary relations must be concatenated in order to produce the resulting 
string. As we describe the transduction as the composition of two transductions, 
the first one building the resulting string interspersed with empty strings and 
the second one deleting the empty strings, we represent concatenation of two 
nodes as putting x and y as putting an empty string between the second copy 
of x, and the first copy of y. So if we add this concatenation requirement to the 
definition of e, we have that e(f st(x) , snd(x)) holds if and only if x is not the 
actual trace of any link and e(snd(x), fst(y)) holds if and only if x < y. In our 
example concatenating 5 and 8 amounts to add the pair (5', 8 ) to the relation 
e. So after the MSO-transduction we have defined is applied to the derivation 
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we have taken as an example we obtain the following relational structure that 
represents the string 7ie<5e72eee/3eeeaee: 



Then a simple string homomorphism can suppress the occurrences of e. In the 
example we would then obtain the relational structure representing, as expected, 
the string 7i<572/3a. Such string to string transformations can be defined with 
MSO-transductions. As MSO-transductions are closed under composition, we 
have showed that the interpretation of proof-nets into strings can be computed 
using MSO-transductions. 

Now if we are concerned with the shortest move constraint, we first need 
to remark that proof-nets are MSO-definable in R(G)-relational structure and 
then we can easily see that the shortest move constraint gives proof-nets with 
a bounded tree width. These proof-nets can therefore be represented as the lan¬ 
guage of a hyperedge replacement grammar (HR-languages). As HR-languages 
are closed under MSO-transduction we get that the languages of MGs with 
shortest move are HR-languages. But it is known that the string languages of 
HR grammars coincide with MCFLs m- 

5 Conclusion 

This work is mainly aiming at clarifying the status of derivations in minimalist 
grammars without the shortest move constraint. It also tries to promote a certain 
attitude towards formalisms describing natural languages. This attitude consists 
in trying to identify and study the abstract syntax of the formalisms. Abstract 
syntax plays a central role in formalisation because it is the particular place 
where the connection between syntax and semantics can be made. It is also at 
the level of abstract syntax that linguistic ideas like move and merge have the 
greatest influence. We have emphasized this role from a mathematical point of 
view by showing that these two operations dramatically reduce the complexity 
of generating syntactic structure by allowing a rather powerful polymorphism 
and also by taming variable binding using only third order types. 

From the mathematical side, this careful study of the derivations in connec¬ 
tion with some simple ideas coming from formal language theory has lead us to 
several new results. First we have showed that the membership problem for MGs 
is as difficult as the problem of proof-search in MELL. This result shows that 
it is not obvious at all that the membership problem for MGs is decidable or 
not. Second we have obtained the unintuitive result that the languages of MGs 
may not be semi-linear contradicting a conjecture by Finally we have ob¬ 
tained a rather interesting and new logical characterization of these derivations 
as closed linear A-terms of a certain type. This characterization can be said as 
being logical since, with the Curry-Howard isomorphism, closed linear A-terms 
correspond proofs in implicative linear logic with proper axioms (they play the 
role of the exponentials) and since we do not appeal to extra constraints in order 
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to rule out terms that would not represent derivations. Furthermore, this way 
of representing derivations is made even more natural by the aforementioned 
Turing-equivalence of the membership problem for MGs and proof-search in 
MELL. 

From a linguistic point of view, this careful study also has some outcomes. 
Indeed, the representation of the derivations of MGs we propose are unambigu¬ 
ous in the sens that, contrary to Stabler’s proposal, they only represent the 
syntactic analysis of one sentence. Furthermore, since the ambiguity of Stabler’s 
proposal makes sentences that have different meanings have the same derivation, 
the disambiguation we propose should allow a simpler interface between syntax 
and semantics. Our proposal makes it also clear that movement and traces are 
adequately rendered by variable binding. It also gives a methodological way of 
extending MGs. Indeed, the linear A-calculus and typing theory offers a good 
framework for devising sensible improvements of MGs. 

Nevertheless, our proposal has several defects. First of all, contrary to what 
would be expected, the feature checking system of MGs is not rendered by using 
the resource sensitivity of linear logic but it is rather modeled by the particular 
management of types that we use. Moreover, the linearisation of those structures 
to string uses a non-trivial homomorphism and most of the computation that are 
induced by the move operation is common to both the syntactic linearisation 
and the semantic interpretation. This can be fixed by transforming derivations 
with a homomorphism into intermediate structures. Interestingly at the level of 
these intermediate structures the feature checking system of MGs is rendered by 
the resource sensitivity of linear logic, unfortunately we would need extra logical 
constraints so as to rule out certain terms that do not represent MG derivations 
and to avoid some spurious ambiguities. 

Finally combining proof-nets and MSO related techniques we are able to give 
a simple interpretation of derivation structures using MSO-transductions. The 
main problem of this approach is the fact that a proof-net is actually transformed 
into a string needs to be proved while this is guaranteed by type-checking in 
the homomorphic approach that maps A-terms to strings. But, this approach is 
rather new in mathematical linguistic, so that maybe there could be certain way 
of guarantying certain properties easily. It would also be interesting to see how 
this approach could be used for semantic interpretation of derivations. 

This work appeals to various techniques from formal language theory. A rule- 
based one that is rendered by our representation of derivations as linear A-terms 
and the homomorphic interpretation of those terms. A descriptive one that re¬ 
sembles Model Theoretic Syntax |22] that uses MSO to describe and interpret 
derivations. This variety of techniques allows us to appeal to many results in 
the literature to retrieve results like na from results on rule-based techniques or 
from results on MSO related techniques. These two points of view are comple¬ 
mentary. The first one helps to understand computational difficulty, to design 
parsing algorithm. The second one simplifies greatly the overall description of 
the formalism. 
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This duality in formalization helps us to understand in a better way the gap 
that separates computational linguists from descriptive linguists. While the first 
are interested in accounting for the way sentences are constructed by designing 
some generation process, the second are mostly describing linguistic data, ex¬ 
plaining how constituents are related in certain circumstances. This opposition 
seems very similar to the one that opposes MTS to Context Free Grammars. 
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Abstract. This paper presents learnability results from Typed Exam¬ 
ples for some classes of Lambek Grammars, in the context of Gold’s 
model of identification in the limit. Typed Examples are semantic infor¬ 
mation and we show that, as soon as syntax and semantics are connected 
by some compositional morphism, they allow to learn rich syntactic for¬ 
malisms. A learning strategy is also presented and exemplified. 


1 Introduction 

Although categorial grammars have been known for a long time, the question 
of their learnability is a recent issue. In the nineties, Kanazawa m opened 
new ways of research in this domain. He proved that, even if the class of every 
AB-Categorial Grammars [2] (or CG) is trivially not learnable from positive ex¬ 
amples in Gold’s model El, large subclasses are. The most interesting classes 
are called fc-valued: they are defined as the sets of CG assigning at most k dis¬ 
tinct categories to each member of their vocabulary. Thanks to his work, we now 
know that for any k > 1, the class of k -valued CG is learnable from Structural 
Examples (i.e. syntactic analysis structures where rules are preserved but inter¬ 
mediate categories are deleted) and from strings (or sentences). These theoretical 
results are associated with learning algorithms, inspired by the pionneer work of 
Buszkowski and Penn [7]. Unfortunately, the only tractable (polynomial) case is 
the learning of rigid (i.e. 1-valued) CG from Structural Examples. The problem 
that naturally arose from these first results was to adapt them to known variants 
of k -valued Lambek Grammars 118| (or LG). But it appeared not to be a very 
easy issue. As a matter of fact, it has been proved that it is possible to learn 
the class of rigid LG from proof structures of a certain normal form j6] but, on 
the contrary, this class is not learnable from strings alone |12| . Thus, for LG, 
learnability results crucially rely on the available input data. Another known 
learnability result on rigid LG relies on additional restrictions on grammars [J]. 

We have introduced the concept of learnability from Typed Examples in the 
context of CG m- It has also been adapted to Pregroup Grammars [5] and 
other formalisms m ■ Typed Examples are sentences where each word is associ¬ 
ated with a lexicalized type derived from its category in the target grammar by a 
morphism. Typed Examples can be considered as intermediary input data, richer 
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than strings but less informative than Structural Examples. They can also be 
interpreted as coming from semantic information, as the types used are inspired 
by Montague’s semantics [19]. In a cognitive perspective, it is relevant to con¬ 
sider that semantics is acquired before syntax [23] . So, providing semantically 
typed examples to help learning grammars is also cognitively relevant mm- 
We have identified interesting subclasses of CG learnable from Typed Examples 
DU- But, in this case, the learnability result itself was a trivial consequence of 
the learnability of rigid CG from strings. For LG, the situation is different as 
rigid LG are learnable from Structural Examples but not from strings. In this 
article, we show that large subclasses of LG are in fact learnable from Typed 
Examples. Furthermore, the learnability result for CG from Typed Examples 
was associated with an original learning algorithm inspired by syntactic analysis 
procedures. While Kanazawa’s learning algorithm implements a generalisation 
strategy, ours is a specialisation strategy mi- This is also in favor of a better 
cognitive relevance m- We show here that this specialisation strategy can also 
be applied to LG. 

After some preliminary definitions, the paper focuses on the notion of learn¬ 
ability from Typed Examples in Gold’s model. It is proved that interesting 
subclasses of LG are learnable from Typed Examples and an original inference al¬ 
gorithm (which is nevertheless not a full learning algorithm in the sense of Gold) 
is provided. This global strategy is illustrated on examples. The properties of 
our algorithm, some implementation details and extensions are also discussed. 

2 Preliminaries 

2.1 Categorial Grammars 

In categorial grammars, the syntax is lexicalized. The syntactic categories as¬ 
signed to each word carry its combinatorial potential. Every kind of categorial 
grammar share the same notion of categories. 

Definition 1 (Categories). Let B be a countably infinite set of basic categories 
containing a distinguished category S £ B, called the axiom. We note Cat{B ) the 
term algebra built over the two binary symbols /, \ which is the smallest set such 
that B C Cat(B) and for any A,B£ Cat(B) we have: /(A,B) £ Cat(B) and 
\(A,B) £ Cat{B% 

Definition 2 (AB-Categorial Grammars and Lambek Grammars). For 

every finite vocabulary E and for every set of basic categories B (S £ B), a cat¬ 
egorial grammar is a finite relation G over E x Cat(B). We note (u, A) £ G 
the assignment of the category A £ Cat(B) to the element of the vocabulary 
u £ E. The family of categorial grammars is composed of two main subclasses: 
AB-Categorial Grammars (CG) and Lambek Grammars (LG). 

1 For reasons that will become clear further, we give up here the classical notations 

B/A and A\B: in our notation, terms /( A, B ) and \(A, B) are both functors whose 

first component A is the argument and whose second component B is the result. 





120 


I. Tellier and D. Dudau-Sofronie 


AB-Categorial Grammars (CG) are categorial grammars where the syntactic 
rules take the form of two rewriting schemes: V A, B £ Cat(B) 

— FA (Forward Application) : /( A,B ) A —> B 

— BA (Backward Application) : A \(A,B ) —■> B 

The language generated by a CG G is: 

L(G)={u\... u n £ E + | Vi £ {1,..., n}, 3 Ai £ Cat{B) such that (, Af) £ G 
and A±,..., A n —A S'}. 

Lambek Grammars (LG) are categorial grammars in which the syntactic analysis 
is described by a logical calculus defined by: 

— axioms: 


[ID] Ah A 


— inference rules: 


UR] 


r,Ah b 
Fh/(A, B) 


[\^R] 


A,ThB 

Fh\(A,B) 


r/rl TbA A,B,IIhC r\ rl rhA A,B,IIhC 
[/ 1 A,/(A,B),r,nhc lv1 A,r,\(A,B),nhC 


where A,B,C £ Cat(B) and r,A,n are finite sequences of categories from 
Cat{B), f ^ 0. The language L{G) generated by a LG G is: 

L(G)={u\.. .u n £ B + | Vi £ {1 3 Ai £ Cat(B) such that ( Ui,Ai ) £ G 

and Ai,..A n h S}. 

In the following, elements of S will be called words and elements of L{G) will 
be called sentences. 

Of course, the rewriting schemes allowed in CG are valid sequents of LG (see 
the rules [/B\ and [\i?]). So, for a given assignment of categories, every sentence 
belonging to the language of a CG also belongs to the language of the corre¬ 
sponding LG. The variant of the Lambek calculus considered in this paper is 
non-commutative, associative, without product and without empty antecedent. 
This variant is the one which has received the greatest linguistic and logical 
attention mm- The Lambek calculus can also be expressed in the context of 
natural deduction, but here we only use the sequent calculus which can be easily 
linked with parse algorithms. 

Example 1. Let B = {5, T, N} be a set of basic categories (T stands for “term” 
and N for “common noun”) and B = {a, lecture, teaches, Alain} a vocabulary. 
Let G be the Lambek Grammar over S x Cat(B) defined by the following as¬ 
signments: {(a, /{N, \(/(T, S), S))), (lecture, N), (Alain, T), (teaches,\(T,S)}, 
(teaches, /(T,\(T, S))}}. G recognizes sentences like “Alain teaches” or “Alain 
teaches a lecture”, as displayed by the following proof (the notation [ID] is omit¬ 
ted for readability): 
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l/L\- 


T\-T [\L] 


ThT ShS 

T,\(T, S)hS 


\/R]- 


\/L\- 


TJ{T,\{T,S)),T^ S 
T,/(T,\(T, S))\-/(T,S) 
_ T,/(T,\(T,S)),\(/(T,S),S)\~ S 

T, /(T,\(T,S)),/(N,\(/(T,S),S)), NhS 
Alain teaches a lecture 


S\- S 


N\-N [\L]- 


It is easy to observe that this sentence is not recognized by the CG with the 
same assignment of categories. To to so, it would be necessary to assign an extra 
category to the word “teaches”: \(T, /(T, S)). The inference rules of the Lambek 
calculus simulate multiple category assignments, but at the price of a higher 
complexity for parsing [23], 

We denote by Q the class of CG and by CQ the class of LG. For every integer 
k > 1, the set of CG (resp. LG) assigning at most k distinct categories to each 
word is the class of fc-valued CG (resp. LG) denoted by Qk (resp. CQk)- 


2.2 Semantic Types 

Montague m was one of the first to propose a typed logic to represent natural 
language semantics. The associated notion of semantic type became so forth 
classical. It is this (slightly generalised) notion of type that will be used here. 
These types can also be assigned to words, according to their lexical semantics. 
Like categories, types characterize a combinatorial potential, but at the semantic 
level. 

Definition 3 (Semantic Types). Let 0 be a finite set of basic types containing 
a distinguished type t G 0. We note Types(0) the set of all possible types, which 
is the smallest set such that: 0 C Types (0) and for any u,v G Types (0), (it, v) G 
Types(0). The type ( u,v ) is assigned to a functor expecting an argument of type 
u and providing a result of type i@. 

Example 2. The usual set of basic types is 0 = {e,f}, where e is the type of 
elementary entities and t the type of truth values. In a logical-based semantic 
representation, identifiers for individuals like “Alain” can be represented by a 
logical constant of type e (Montague prefered a more complicated type) while 
common nouns and intransitive verbs can be associated with one place predi¬ 
cates, of type (e, t). Transitive verbs are denoted by two-place predicates, of type 
(e, (e, t)). Verbs like “teaches” can have both a transitive and an intransitive use, 
and thus receive both assignments of types. 

There is a close connection between categories of categorial grammars and se¬ 
mantic types. Both are binary terms. This connection can be formalized by the 
notion of Typing Function. 

2 In Montague’s tradition, this type would be noted (u , v), or u —> v, but we prefer 
a notation closer to the one of categories. 
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Definition 4 (Typing Function). For any set of basic categories B (S £ B) 
and any set of basic types 0 (t £ 0), a Typing Function h is a morphism from 
Cat(B) to Types(0) satisfying the following conditions: 

1. h(S) = t; 

2. \/A,B £ B: if h(A) = h(B) £ 0 then A = B. Note that this does not imply 
that h is injective on B because of the condition that both images must belong 
to 0, i.e. be basic types. 

3. VA,B£ Cat(B): h(/(A,B )) = h(\(A,B)) = (h(A),h(B)). 

As h is a morphism and Cat(B) is built over the set B, it is enough to define h 
on B to deduce its values on Cat(B). This definition justifies the notation chosen 
for categories, where the operators (/ or\) playing the role of functors in a term, 
are simply deleted by the Typing Function. 

Example 3. If we set: h(T) = e,h(S) = t,h(N) = (e,t), we define a Typing 
Function for the categories of the grammar in Example |T| perfectly compatible 
with their semantics in Example [2] Note that a basic category (TV in our exam¬ 
ple) can be associated with a non-basic type and that two distinct (non-basic, 
otherwise it would contradict condition (2)) categories can be associated with 
an identical type as: h(\(T,S)) = ( h(T),h(S )) = (e,t) = h(N). As a matter of 
fact, both common nouns and transitive verbs semantically behave as one-place 
predicates, but they are not syntactically equivalent: so, they share the same 
semantic types but not the same syntactic categories. 

The Principle of Compositionality asserts that the meaning of a sentence 
only depends of the meaning of its parts and of its syntactic structure I22HB1 
(where the “parts” are usually identified with words). This principle is at the 
heart of the correspondence between syntax and semantics in Montague’s work, 
and in the categorial grammar framework in general. In this framework, it is 
usually translated by a similarity of structure between syntactic and semantic 
trees (or proofs). But categories and types are lexicalized structures linked by a 
Typing Function. The Typing Function can thus be seen as the lexicalized version 
of the Principle of Compositionality. If semantics is acquired before syntax, it 
means that semantic types may be available to a learner who has to acquire the 
syntactic categories of his mother tongue. So, the inputs available to this learner 
are sentences labelled by semantic types: this is what we call h-Typed Examples. 

Definition 5 (The h-Typed Language of a Lambek Grammars). For 

any sets E, B and 0, any LG G over E x Cat(B) and any Typing Func¬ 
tion h from Cat(B) to Types (0), the h-Typed Language of G is defined by: 
(ui,Ti)...(u n ,T n ) £ TL h (G ) ifWi £ {1, ...,n} 3A t so that ( Ui,A t ) £ G, n = h(Af) 
and Ai, ...,A n b S. An element of TLh(G) is called a h-Typed Example of G. 

Example f. The h-Typed Example corresponding with the sentence analysed in 
Example [T] and the Typing Function of Example [3] is the following: 

{Alain, e){teaches, (e, (e, t)j){a, ((e, t), ((e, t),t))}(lecture, (e, t)). 
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Definition 6 (Lengths). For any category A £ Cat(B) (resp. for any type 
t £ Types(0)), the length of A, noted |A| (resp. the length of r noted \t\) is the 
number of basic categories (resp. of basic types) it contains: 

— if A £ B (resp. t £ 0) then |A| = 1 (resp. |r| = 1); 

- VA, B £ Cat(B) (resp. Vu,v £ Types{0)) \/{A 1 B)\ = |\(A,B)| = |A| + \B\ 
(resp. |(u, v)\ = |u| + \v\). 

Lemma 1. (trivial) For any set of basic categories B, any set of basic types 0 
and any Typing Function h from CatfB) to Types(0) we have: 

VC £ Cat(B), \C\<\h(C)\. 


2.3 Grammar Systems and Learnability Theory 

To deal with questions of learnability, Kanazawa m introduced the notion of 
Grammar System, allowing a reformulation of the classical Gold’s model of iden¬ 
tification in the limit from positive examples [14] . We recall this notion here and 
some known learnability results concerning categorial grammars. 

Definition 7 (Grammar System). A Grammar System is a triple (17, A, L): 

— 17 is the hypothesis space (here, 17 will be a set of grammars), 

— The sample space A is a recursive subset of A*, for some fixed alphabet A 
(elements of A are sentences and subsets of A are languages); 

— L : f2 —> pow(A) is a naming function. The question of whether w £ L{G) 
which holds between w £ A and G £ 17, is supposed to be computable. 

The main Grammar System we deal with in the following of this paper is 
(CG, (E x Types(0))* ,TLh)■ It means that we are going to study how to learn 
a Lambek Grammar from Typed Examples, that is from sentences where each 
word is associated with its semantic type. 

Definition 8 (Learnability Criterion). Let (17,4, L) be a Grammar System 
and (f> : (J fc>| A k —> 17 be a computable function. We say that <p converges to 
G £ 17 on a sequence (sfji^ of elements of A if Gi = 4>((so, ..., sf)) is defined 
and equal to G for all but finitely many i £ N - or equivalently if there exists 
no £ N such that for all i > no, Gi is defined and equal to G. Such a function (f> 
is said to learn Q C 17 if for every language L in L(G) = {L{G)\G £ G} and for 
every infinite sequence (si)j g N that enumerates the elements of L (i.e. such that 
{si|i £ N} = L), there exists some G in Q such that L(G) = L and </> converges 
to G on (sj) ieN . 

Kanazawa m proved that for every k > 1, the class Qk of CG assigning 
at most k distinct categories to each of its words is learnable in the Grammar 
System (Q,E*,L). So, any CG which is known to be fc-valued for a given k 
is learnable from plain sentences. As a consequence, subclasses of CG are also 
learnable with more information, such like Typed Examples. These classes can 
thus trivially be learned in the Grammar System (Q, (E x Types(0))* ,TLh) jllj . 
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For LG, we also know that the class of rigid LG is learnable from Structural 
Examples where the structure is provided by a normal form of proofs [B], but 
is not learnable in the Grammar System (CG,£* ,L) [12] he. from strings alone. 
We will now focus on the learnability of subclasses of LG in the Grammar Sys¬ 
tem (CG, (S x Types (0))* ,TLh), i.e. from Typed Examples, which are h-Typed 
Examples where h is not known. 

3 Learning Lambek Grammars from Typed Examples 

In this section, we first prove learnability results for subclasses of LG from Typed 
Examples. We then describe an original algorithm which provides the set of LG 
compatible with a set of Typed Examples. 

3.1 Learnability Theorems 

The classes of LG we are interested in are those for which types are enough to 
characterize categories. The condition we introduce thus means that each time 
a single word is associated with two distinct categories, then the corresponding 
semantic types are also distinct. 

Definition 9 (Typed Lambek Grammars). For every vocabulary S, every 
set of basic categories B, every set of basic types 0, the class of Typed LG CGtype 
is the set of LG G over £ x Cat(B) such that there exists a Typing Function h 
from Cat(B) to Types(0) satisfying the condition: 

Wu e £, V(u,A) € G and (u,A') £ G, A ± A! => h(A) ± h(A'). 

The class CGtype is very similar to the one studied in the context of CG called 
We have proved interesting language theoretical results concerning 
Gtype for the special case where h = ho, defining a one to one correspondence 
between B and 0 (i.e. ho only deletes operators without modifying anything 
else). As a matter of fact, for every CG, there exists a member of this class with 
h = ho recognizing the same structure language. We do not have any similar 
result for LG and the class CGtype but note that CGtype contains every rigid LG 
and intersects every set of k -valued LG. For example, the LG of Example [I] is 
2 -valued and in CGtype because for any h, the two distinct categories assigned to 
the word “teaches” are associated with distinct types. Nevertheless the restric¬ 
tion expressed in Definition 0 prohibits to assign both /(TV, \(/(T, S), S)) and 
/(TV, /(\(T, S), S)) to determiners (classically, the first one is assigned to deter¬ 
miners introducing direct objects, the second one to determiners introducing 
subjects) because both always lead to the same type (the type ((e, i), ((e, t),i)) 
in our example). We will see in the following how to treat such cases. 

Theorem 1. The class CGtype is learnable from Typed Examples, i.e. in the 
Grammar System (CG, (£ x Types (0))* ,T Lh) ■ 

The main idea of the proof is that there exists a finite number of possible 
categories which are compatible with a given semantic type, and so a finite 
number of possible grammars (and of possible h) compatible with any given 
sample of Typed Examples. This property is detailed in the following lemma. 
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Lemma 2. For every vocabulary E , every set B and 0, every G £ CGtype with¬ 
out useless category (i.e. without any category assignment never used in any syn¬ 
tactic analysis) associated with the Typing Function h from Cat(B) to Types (0), 
there exists an integer N £ N and a finite sample (si)i<N Q TLh(G), such that 
from this sample it is possible to compute: 

— the least integer k such that G is k-valued; 

— a bound on the maximal length of the categories assigned to the words of G; 

— a bound on the maximal number of distinct basic categories used to define 
the categories assigned to the words of G. 

Proof (proof of Lemma HJj. For any given G £ CGtype-, the required charac¬ 
teristic sample just needs to contain some Typed Examples such that at least 
one occurrence of every couple (word, type) appears somewhere. By definition 
of G £ CGtype-, it is enough to take the Typed Examples corresponding with 
sentences such that at least one occurrence of every couple (word, category ) is 
rquired for parsing. From such a sample set, let us compute each of the values: 

— To compute the least integer k such that G is fc-valued, it is enough to 
note that the condition for G to belong to CGtype is precisely that there are 
exactly the same number of distinct couples (word, category) in G as there 
are corresponding distinct couples (word, type = hfcategory)) in elements 
of TLh(G). After all such couples have been presented at least once in the 
sample set, k is available. 

— To compute a bound on the maximal length of categories assigned to the 
words of G it is, similarly, enough to take the maximal length of types ap¬ 
pearing in elements of the sample set. Lemma [T] ensures that the bound on 
the lengths of types is also a bound on the lengths of categories. Let us call 
L such a bound. 

— Finally, to compute a bound on the maximal number of distinct basic cate¬ 
gories used to define the categories assigned to the words of G, it is enough 
to take advantage of both previous results. This number is bounded by 
kx Lx \E\ where \E\ stands for the number of distinct words in G, available 
as soon as each word has been presented at least once. 

Proof (proof of Theorem QJi. The theorem is a direct consequence of LemmaH] As 
a matter of fact, the lemma implies that there exists a finite computable number 
of LG without useless categories (up to a renaming of basic categories) compat¬ 
ible with any sequence enumerating the elements of some TLh(G) and that this 
set is recursively enumerable. This situation classically implies mu the learn- 
ability of the class CGtype in the Grammar System (CQ, (E x Types(0))*,TLff)- 

It is now easy to generalize this result to the case of multiple category assign¬ 
ments to the same word, corresponding with a unique type. 

Definition 10 (m-distinct Typed LG). For every vocabulary E, every set 
of basic categories B, every set of basic types 0 and every m > 1, the class of 
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m-distinct Typed LG noted CQffj pe is the set of LG G over £ x Cat(B) such 
that there exists a Typing Function h from Cat(B) to Types(0) satisfying: 

max (Card{A'\(u, A') £ G and h(A) = h(A')\) < m 

{u,A)gG 

When m = 1, this condition is equivalent with Definition [9] so we keep the 
notation: CQt ype — CQ\ ype . If, for example, G £ CQ contains two assignments for 
determiners associated with the same type (e.g with (a, / (N, \(/(T, S), S ))) £ G 
for introducing direct objects and (a, /(N, /(\(T, S), S))) £ G for introducing 
subjects) then G £ CQf ype but G ^ CQ) ype . 

Theorem 2. (obvious) 

Vm > 1, CQ™ pe C cg%£ and CQ = \J CQ™ ype 

m> 1 

The hierarchy of TO-distinct Typed Lambek Grammars plays the same role 
here as the one played by the hierarchy of fc-valued CG in m- Similarly, the 
results obtained for the class CQ type can easily be generalized to the classes 
CQf)j pe for any m > 1. 

Theorem 3. For any m > 1, the class CQf(j pe is learnable from Typed Examples, 
i.e. in the Grammar System (CQ, (£ x Types (0))* ,TLh) ■ 

Proof (of Theorem [3j). The known number to is a new multiplicative factor to be 
applied on the previous computation of the number of distinct LG compatible 
with a set of Typed Examples, but this set of grammars is still finite. 

The proofs of these theorems suggest enumerative learning algorithms. This 
is not, of course, a tractable strategy. We define in the following another way to 
identify the set of LG compatible with a given set of Typed Examples. 

3.2 Learning Strategy 

We first focus on the class CQt ype (he. to = 1 ) and propose a strategy to infer 
LG in this class from Typed Examples. The key point of this strategy is to 
observe that types provide indications about the functor or argument nature of 
the words they are associated with, even if the direction of the operators (/ or \) 
are lost. Types are “degraded versions” of the categories they derive from by h 
and the goal of the learning strategy is to “rebuild” the categories from the types. 
This will be acheived in three steps described below: variabihsation , constraint 
inference and category deduction. 


Variabilisation. First, a step of variabilisation is necessary to introduce vari¬ 
ables in type expressions at operator positions, i.e. before every opening paren¬ 
thesis of every type. This step is applied once for all Typed Examples in the 
input, respecting the following constraint: every occurrence of the same word 
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with the same associated type receives the same variables. This constraint is a 
direct consequence of the target class of grammars: for every grammar in CQt ype , 
each time the same couple (word, type) occurs in a Typed Example, we know 
that it refers to a unique couple (word, category). 

Definition 11 (types with variables). Let X be an infinite countable set of 
variables. The set of types with variables over the set of basic type0 is denoted 
VarType(0 ) and is defined as the smallest set such that: 0 C VarType(0) and 
for any u, v £ VarType(O) and Xi £ X U {/, \}, Xi(u, v) £ VarType(0). 

The variables in X are mapped to X U {/,\}. Note that this variabilisation 
step is defined relatively to a set of Typed Examples. 

Example 5. The variabilisation step applied to the Typed Example of Example U] 
gives: 

Alain teaches a lecture 

e xo(e,xi(e,t)) X2(x3(e,t),X4(xs(e,t),t)) x&(e,t) 

In fact, as explained in m for CG, such a variabilisation implicitely specifies 
a set of grammars: the set of every grammar in CQtype sharing the same semantic 
type assignments. As already seen, this set of grammars is always finite. Among 
them, we now want to select the grammar(s) for which there exists h such that 
their h-Typed language contains a given set of Typed Exampes. To characterise 
these grammars, we will need to constraint the possible values of the introduced 
variables. 


Constraint Inference. This step consists in deducing constraints over the 
variables introduced. These constraints will be stored into substitutions. 


Definition 12. A substitution is a mapping from X to X U {/, \}. For any 
substitution a, a is extended over VarType(0) as follows: 

1. \/u £ Types(Q) U {/,\}, <r(u) = u; 

2. <r(xi(u,v)) = o(xi)(a(u),a(v)). 


For any substitution cr, we define an adapted er-dependent Lambek-inspired 
sequent calculus: 


— axioms : [ID] a ^ for any A, A' £ VarType(O); 

xj. 11 XJ. 

— inference rules : 


l/RV 


r,A\'r a B <x(x) = / 
rih CT x(A,B) 


[\A] CT 


A,r\\-vB a(x) = \ 
r lh CT x(A,B) 


, IT ia r ih CT a A,B,nu- a c a(x) 
1/1 A, x (A,B),r, n IF“ c 


/ n r i cr r lh CT A A, B, n 11 (j C a(x) = \ 
J A,r,x(A,B),n£ a c 


3 For the examples in this presentation of the learning strategy we always use the set 
of basic types 0 = {e,t }. 
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where A,B,C £ VarType(0) and T, Z\, 77 are sequences of types with vari¬ 
ables in VarType(O), r 0. In this calculus, a derivation is constrained by 
conditions on types with variables but also by conditions on a. Making a proof 
in this system thus assigns values to a on X. 

To deduce constraints on category assignments from a given Typed Example, 
it is enough to prove that the sequent having the corresponding sequence of 
types with variables as antecedent and t as consequent is valid in a cr-sequent 
calculus. Searching such proofs by backward chaining, as for parsing, provides 
sets of constraints on cr. Solutions of these sets of constraints are the resulting 
substitutions. If the sequence of types with variables comes from a /i-Typed 
Example (i.e. is the result of applying some h to a sequence of categories which 
can be proved to derive S in the Lambek calculus), it is easy to see that there 
exists at least one a such that the sequent can be proved in this cr-sequent 
calculus (cr can be obtained from h). 

Example 6. Applied to the variabilised Typed Example of Example [5] the search 
for a proof in a cr-sequent calculus gives rise to the search tree of Fig. [T| where 
only the branches leading to valid proofs are displayed. In ovals are given the 
constraints applying on substitutions and in rectangles the sequents obtained 
after applying the rules (not themselves represented in the Figure). 7 substitu¬ 
tions are obtained, among which only 5 are distinct (we have: cr 2 = <73 = 174). 
They are summed-up in Table [T| 

Table 1 . The substitutions inferred for Example [5] 


Variables 

o-i 

cr 2 

0-5 

0-6 

0-7 

Xo 

\ 

\ 

/ 

\ 

\ 

34 

04(3:3) 

04(3:5) 

\ 

/ 

\ 

34 

\ 

/ 

\ 

\ 

\ 

X 3 

04(34) 

0-2 (34 j) 

/ 

/ 

\ 

X4 

/ 

\ 

/ 

/ 

/ 

Xs 

04(343) 

04(34) 

0-5 (34>) 

0-6 (345) 

0-7 (3:5) 

XQ 

04(3:5) 

0-2 (3:3) 

0-5 (3:5) 

0-6 (3:5) 

0-7(335) 


Each substitution selects a subset of grammars associated with their Typing 
Fonctions among the set CQtype- In this sense, our algorithm is a specialisation 
strategy at the set level: every new constraint reduces the search space. 

But in Gold’s model, the learning function takes as input a set of examples , 
not a single one. Each Typed Example is treated one after the other. At each 
step are kept only the substitutions that are compatible with the new Typed 
Example being analysed. The compatibility relation is a composition denoted 
by Q. For any substitutions 04 and cr 2 we define: 


Mu £ Types(< 9 ) U {/, \}, (04 © cr 2 )(u) = u, 

<71 ( Xi ) if Ol ( Xi ) = 0 2 ( Xi ) 


Mxi £ X, (04 © 04X34) = 


04(34) if 02(04) = 34 
04(34) if <t 1 (34) = Xi 

not defined if 04(34), 04(34) € {/, \} and 04(34) 7^ 04(34) 
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Note that the composition Q between substitutions can be seen in terms 
of the classical notion of “most general unifier” (mgu) between substitutions, 
taking into account the domain of these substitutions. The composition of two 
sustitutions can introduce new equality constraints between variables (first line 
of the definition). It does not exist when the two substitutions are not compatible 
on at least one variable because substitutions are functions and thus cannot take 
both values / and \ for this variable. The other conditions of the definition rely 
on the fact that the set of substitutions is initialized with the Identity function 
on VarType(0) . 

Example 7. Let us consider compositions between the subtitutions defined in 
TablejT] The composition o\ Q <72 is undefined on X because a 1 ( 2 : 2 ) 7 ^ 02 ( 2 : 2 ) 
but (T 1 0 cr 6 is defined by: (ay 0 a 6 )( 2 ;o) = \, (01 0 06) ( 2 : 1 ) = 01 ( 2 : 3 ) = /, 
(01 © 06 )( 2 : 2 ) = \, ( 01006 X 2 : 3 ) = 01 ( 2 : 1 ) = /, (0i©0 6 )(2;4) = /, ( 01006 )( 2 : 5 ) = 
01 ( 2 : 6 ) = 06 ( 2 : 6 ), (01 0 06 )( 2 : 6 ) = 01 ( 2 : 5 ) = 06 ( 2 : 5 )- 


Category Deduction. Finally, after every Typed Example has been treated, 
each distinct remaining substitution will give rise to a distinct LG G, associ¬ 
ated with its specific Typing Function h. To obtain G and h from a substitution 
o’, the first thing to do is to apply 0 to each type with variables appearing 
at least once in the variabilised Typed Examples. A step of category deduc¬ 
tion is necessary because a non-basic type can derive by h from a basic cat¬ 
egory (remember h(N) = (e,t)). The deduction of categories is performed as 
follows: 

1. the type t, in the axiom t \\- a t, is associated with the basic category S'; 

2. every other class of distinct subtypes with variables linked by axioms [ID] a in 
a proof (i.e. each class of subtypes with variables equal up to the substitution 
0 ) is associated with a new basic category. 

The definition of each Typing Function h naturally follows from these defini¬ 
tions. 

Example 8 . The application of the substitution o\ (obtained in the leftmost 
branch of Fig. [0 to the types with variables appearing in the Typed Example 
of Example Ogives the following assignments: (Alain, e), ( teaches , \(e, X\(e, t))), 
(lecture, x$(e, t)) and (a, \(x\(e, t), /(x§(e, t),t))). By convention, when a substi¬ 
tution integrates several equality constraints between variables, its value on these 
variables is the variable of least index. In the leftmost branch of Fig. |T] the axiom 
e I ho- e induces the definition of a basic category A\ (with h(A{) = e). The axioms 
X\(e,t) Iho- X 3 (e,t) and xq( e,t) \\~ a x$ (e,t) induce the definitions of A 2 and A 3 
respectively (with h(A 2 ) = (e, t) = h(As)). As long as variables X\ and x$ remain 
distinct (we have not obtained a constraint of the form <ri(a:i) = < 71 ( 2 : 5 )), A 2 and 
A 3 also remain distinct, which illustrates that h is not necessarily inductive. The 
final LG obtained is: G = {(Alain, A x ), (teaches, \(A 1; A 2 )), (a, \(A 2 ,/(A 3 , S))}, 
(lecture, A 3 )}. 
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Note that if a semantic distinction is made, for example, between “animated” 
and “not animated” nouns, associated with two distinct types (for example ei and 
62 ), this would lead to a distinction between two syntactic categories for common 
nouns (for example some Ni and N 2 respectively). But note also that having the 
same (not basic) semantic type does not at all necessarily means having the same 
syntactic category: for example, common nouns and intransitive verbs, are both 
of type (e, t), but they will nevertheless most of the time receive distinct syntactic 
categories. As a matter of fact, this type will be variabilised into as many distinct 
Xi(e, t) as there are distinct words associated with the initial type. Then, it is ex¬ 
pected that the variable Xi will sometimes unify with \ (for example in a sentence 
like “Alain teaches”, where (e,t) is the type of “teaches”), and some other times 
stay unspecified because common nous are usually not used as functors. This dis¬ 
tinction is enough to provide two distinct syntactic category. 

The final global inference strategy applied to a set of Typed Examples is given 
in Algorithm [T] 


Algorithm 1. Global inference strategy for a set of Typed Examples 
Require: TE = {tei,... ,te n } a set of Typed Examples; 

1: U = {Id Va ,rTypes(e)} the initial set of substitutions; 

2: for every Typed Example td € TE do 

3: introduce variables respecting the condition of Sect. !3.2l to obtain tvi ; 

4: end for 

5: for every sequence of types with variables tvi do 

6: prove the sequent tvi \\~ a t in the <r-sequent calculus to obtain a set of substitu¬ 

tions Ui ; 

7: U := {a Q ai\cr £U,cri £Ui}', 

8: end for 

9: for every substitution m from U do 
10: apply over types with variables; 

11: apply rules (1) and (2) of deducing categories to obtain (Gi,hi) 

12: end for 

Ensure: Q r (TE) = {{Gi, hi)\u-i G U} 


Exemple of the Global Strategy. Let us apply the previous algorithm to a 
pair of Typed Examples. 

Alain teaches a lecture 

e (e, (e,t)) ((e,t),((e,t),t)) (e,t) 

e x 0 (e, xi(e, t)) X 2 (x 3 {e,t),xj(xs{e,t),t)) x e (e,t) 

Isabelle writes a paper 

e (e, (e, t)) ((e,t),((e,t),t)) (e,f) 

e x 7 (e,X8 (e,t)) X 2 (x 3 (e,t),Xj(xs(e,t),t)) Xg(e,t) 

For this sample of Typed Examples, the variabilisation (third line of each 
example) has been performed according to the constraint of Sect. 13.21 So, both 
occurrences of the determiner “a” receive the same variables. After treating the 
first Typed Example, the resulting substitutions are those given in Table Q] 
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The second Example “Isabelle writes a paper” is treated exactly the same way 
as the first one and also gives rise to 5 different substitutions (replace xo by xr, x\ 
by Xg and xg by xg in Table[l]to obtain their values). Among the 25 possible com¬ 
positions between one <7,; and one of these new substitutions, only 13 are defined, 
and 7 distinct. The 7 distinct LG of the following table are finally obtained: 



Alain teaches 

a lecture Isabelle writes 

Daper 

Gi 

Ai 

\(A 1 ,\(A 1 , 5 )) 

\(\(A 1 , 5 ),/(A 2 , 5 )) 

A.2 

Ai 

\(A 1 ,\(A 1 ,S)) 

A2 

g 2 

Ai 

\(Ai, A 2 ) 

\(a 2 , /(a 3 , S)) 

A3 

Ai 

\(Ai, A 2 ) 

A3 

g 3 

A, 

\(Ai, A 2 ) 

/(A 3 ,\(A 2 ,S)) 

A3 

Ai 

\(Ai, A 2 ) 

A3 

g 4 

A, 

MA^/iAuS)) 

\(/(A 1 ,S),/(A 2 ,S)) 

A2 

Ai 

\(Ai,/ (Ai, S)) 

A2 

g 5 

A, 

\(Ai, /{Ai, S)) 

\U(A 1 ,S),/(A 2 ,S)) 

A2 

Ai 

/(A 1 ,\(A 1 ,S)) 

A2 

G 6 

A, 

/(Ai,\(Ai,S)) 

\{/(Ai,S), /{A 2 , S)) 

A2 

Ai 

/(A 1 ,\(A 1 ,S)) 

A2 

g 7 

A, 

/(AuMAuS)) 

\(/(A 1 ,S),/(A 2 ,S)) 

A2 

Ai 

\(A 1 ,/(A 1 , S)) 

A2 


Among these grammars, the last three are “real” Lambek Grammars in the 
sense that the same assignment of categories in a CG would not allow the analysis 
of at least one of the two initial sentences. 

Note that every solution grammar assigns the same basic category A\ to 
“Alain” and “Isabelle”. This is a direct consequence of their assignment to the 
same basic type and of condition ( 2 ) of Definition |U More interestingly, every 
solution grammar also assigns the same basic category to “lecture” and “paper”. 
This results from an equality condition between the values of every final substi¬ 
tution on Xq and xg. Fundamentally, it is because both words are introduced by 
the same determiner “a”, whose type is variabilised only once. But the various 
subtypes (e, t ) occurring in initial types gave rise to various categories, basic or 
not (for example in Gi, hi(A 2 ) = (e, t) = hi(\(Ai, S))). In fact, grammatical 
categories can be defined as equivalence classes between subtypes that have the 
same combinatorial behaviour. The substitutability at the “type with variables” 
level is the criterion our algorithm uses to infer grammatical categories. 

It can also be noted that the 7 grammars obtained can be reduced to 3 classes, 
on the basis on their expressive power. Gi is clearly apart. Gg and G3 are such 
that, up to a renaming of basic categories, h 2 = h 3 and \/w £ S, 3 C 2 ,C 3 £ 
Cat(B) with: (w, C 2 ) £ G 2 , (w,C 3 } £ G3, Cg h G3 and G3 b G2. It is easy to 
prove (using the Cut rule in the Lambek calculus, not presented here) that this 
condition implies that V 7 i, TL^Gg) = TLh(G 3 ). So, G2 and G3 are equivalent 
with respect to the convergence criterion of learnability in the limit and it is 
enough to keep memory of only one of them. The same situation occurs for the 
four grammars G 4, G5, Gg and G7. Unfortunately, the previous condition is only 
a sufficient one for Typed Languages to be equal. 

3.3 Properties of the Algorithm 

In this section, we discuss the main properties of the algorithm: it is correct 
and complete, in the sense that it is able to identify every LG without useless 
category -and up to a renaming of basic categories- and its associated Typing 
Function, compatible with a given set of Typed Examples. Nevertheless, it is not 
in itself a learning algorithm in the sense of Gold. 
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Theorem 4. For every sets £, B and 0, every G £ CQt ype and every set TE 
of Typed Examples of G, the result Q r (TE) of our algorithm contains every 
possible couple ( Gi,hi ) -up to a renaming of basic categories- such that Gi is 
without useless category, hi is a Typing Function from Cat{B) to Types(0) and 
TE C TL hi (Gi). 

The complete proof of this correctness and completeness theorem is a direct 
adaptation of the one for CG (5J. The key point consists in the following lemma: 

Lemma 3. For every £, B, 0 and h, every non empty finite sequence of cat¬ 
egories r from Cat(B) and every category C £ Cat(B) we have: r h C in the 
Lambek calculus if and only if there exists a substitution a on VarType{0) such 
that A ho- G' in the a-dependent Lambek calculus, where A £ VarType(0) is 
the variabilisation of h(r) and C' £ VarType{0 ) is the variabilisation of h(C), 
the variabilisation step being performed according to Sect. 13. A 
Note that h is naturally extended to finite sequences of categories, i.e. if T = 
A\... A n then h(r) = h.(Ai)... h(A n ). 

Proof (sketch of proof of Lemma\3f). Two things must be proved: 

— if r h C, we can define a from h by using the variabilisation step (trivial); 

— if 3a A t= CT C' then an induction on the length of the proof must be done: 

• if only an axiom [ID] a is used, i.e. a(A) = a(C' ) then apply the category 
deduction phase in Sect. 13.21 a new category A is introduced, with h(A) 
equals the un-variabilised version of A and C'. Either A = C' £ 0 and 
this is consistent with Definitions] (2), either A and C' are not basic 
and a defines equality conditions between variables. In any case, we 
have r = C = A with A b A an axiom of the Lambek calculus. 

• each time a rule in the cr-dependant Lambek calculus is used in the 
proof, the corresponding rule in the Lambek calculus will be applicable 
on the categories introduced by the category deduction phase. This step 
of deduction also prevents from deducing useless categories. 

The main difference between the lemma and the theorem is the fact that a 
real input is made of several Typed Examples and not of only one. So everything 
relies on the correct definition of the composition Q between substitutions. 

But the previous theorem is not enough to make our algorithm a learning 
algorithm in the sense of Gold, as it does not allow to select a unique solution 
grammar, as required by the “learnability in the limit” criterion. Note that the 
same problem occurred for Kanazawa’s strategy to learn CG from strings [T7|. 
But the problem is even stronger for LF, because it is possible that 3 (Gi,hf), 
(Gj,hj) £ Q r (TE) such that TL^iGi) £ TLh^Gj) (see Example [9]). This 
situation never occurred for CG. When it occurs, to avoid over-generalisation, 
the least general grammar Gi should be chosen. It is not even known whether the 
inclusion of Typed Languages is computable. This inclusion can nevertheless be 
checked for every Typed Example of bounded length: this is, again, Kanazawa’s 
strategy for learning CG from strings. It is computable but not tractable in 
practice. 
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Example 9. Let £ = {a, 6, c, d, e, /} be a vocabulary, B = {A,B,S} the set of 
basic categories and <9 = {a,t} the set of basic types. We define G £ CQ by: 

G = {(a, /( A , A)), (6, /(B, A)),(c, B), (d, A),(e, \(A,\(^, A))>,</,\(A,\(A, 5)))}. 
G is rigid, so Vh, G £ CGtype- Let h be the following Typing Function: h(A) = a, 
h(B) = (a, a) and let the following sequence of h-Typed Examples: 

(1) (a, (a, a)) (b, ((a, a), a)) (c, (a, a)) (d,a) (e, (a, (a, a))) (d,a) (f,(a,(a,t))). 

(2) (6, ((a, a), a)) (c, (a, a)) (6, ((a, a), a)) (c,(a,a)) (e, (a, (a, a))) (d, a) 

(/, (a, (a,i))). 

(3) (d,a) (d, a) (d,a) (e, (a, (a, a))) (/, (a, (a, t))). 


Applying Algorithm [T| to this set provides several solutions (■} among 
which are the following two ones: 


_ <Gi,/ti) _ 

(a,/(Ai,Ai)) with hi(Ai) =a 
(6,/(A 0 ,Ai)) with hi(A 0 ) = (a, a) 
<c,A 0 ) 

(d, Ai) 

(e, \(Ai, \(Ai, Ai))) 

</,\(A 1 ,\(A 1 ,S))) 


_ (G 2 ,h 2 ) _ 

(a, /(A 2 , A 2 )) with h 2 (A 2 ) = a 
(6,/(/(A 2 ,A 2 ),A 2 )) 
(c,/(A 2 ,A 2 )) 

(d, A 2 ) 

(e,\(A 2 ,\(A 2 , A 2 ))) 

(/) \(A 2 , \(A 2j 5))) 


The two grammars Gi and G 2 are such that TL/ ll (Gi) C TL/ t2 (G 2 ). The 
inclusion of Typed Languages is obvious (every derivation in Gi can be trans¬ 
formed into a derivation in G 2 , replacing Ao by /(A 2 , A 2 ) and Ai by A 2 ). The non 
equality is exemplified by the following Typed Example, element of TL^ 2 (G 2 ) 
but not of TL/jj (Gi): (c, (a, a)) (d, a)(d, a)(/, (a, (a, t))). 


3.4 Implementation 

We have specified the heart of our strategy (the constraint inference phase) in 
terms of sequent deduction, without specifying any implementation details. In 
fact, every algorithm able to perform a syntactic analysis within the Lambek 
calculus can be adapted to a constraint inference algorithm and integrated at 
step 6 of our Algorithm 1. It is known that the complexity of such a parsing 
algorithm is at least exponential in the number of words in a sentence |25], so is 
the step 6 of Algorithm 1. Furthermore, we have already explained in m that 
the number of distinct CG compatible with a given Typed Example of n words 
could reach G(n, 2n). This result is all the more true for LG (with the same 
example): it means that for a given sequence tvi of n types with variable, the 
set lAi may contain C(n,2n) distinct substitutions. 

To implement the strategy in practice, a valid heuristics based on the Count 
function defined in |28i can be used. Applied to (sequence of) categories, this 
function computes the exponent of a basic category in this sequence. A necessary 
condition for a sequent to be valid in the Lambek calculus is that, for every basic 
category, the value of Count must be equal on each side of this sequent. This 
criterion applies as well on (sequences of) types. In this case, let Count be defined 
for elements of VarType(0) as follows: 
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1. Count T (r) = 1, Vr £ 0 

2. Count T (p) = 0 if t ^ p, Vr, p £ 0 

3. Count T (xi(a 1 , 0 : 2 )) = Count T (ai2) — Count T (ai),Vxi £ A’U{/,\} and Vr £ 
0, V«i,a 2 £ VarTypeiO). 

It is naturally extended to sequences: Count T (ri t?) = Count T (ri)+Count t (t 2 )- 
It is easy to prove that for any sequence of types with variables T and A and 
every substitution <7: if r \\- a A then Vr £ 0, Count T (r) = Count T (A). This 
equality can be checked in linear time and allows to prune unfruitful branches 
during the search for a proof. 

3.5 Extensions 

It remains to see how our learning strategy could be adapted to learn the class 
CGujpe (see Definition [TUI) from Typed Examples. Two approaches are possible. 

The first one consists in changing the variabilisation phase, by giving up the 
condition of Sect. 13.21 In this case, all introduced variables are distinct and 
the phase of constraint inference is performed as described. But the category 
deduction phase must be modified, to take into account the fact that every 
couple (word, category) can give rise to at most rn distinct couples (word, type). 

The second possible approach starts preserving the variabilisation phase as 
described in Sect. IQ and the constraint deduction phase, but allowing that 
substitutions are no longer functions but relations (allowing different possible 
values for some cr(a;)). The definition of the composition Q between substitutions 
in Sect. 13.21 is thus changed, to allow at most m distinct values for every set of 
variables appearing in a unique type. For examples, if the target LG belongs 
to CGtypei the set of variables {< 7 ( 0 : 2 ), < 7 ( 3 : 3 ), < 7 ( 2 : 4 ), < 7 ( 3 : 5 )} (i.e. the variables 
introduced in the type of the determiner “a”) is allowed to have two different 
extensions. The category deduction phase is then not modified. 

We haven’t precisely proved the completeness and validity of these approaches 
but they are natural extensions of the basic one. Whichever is chosen, the new 
algorithm has an exponentially higher complexity than the previous one. 


4 Conclusion 

Before we started our study, positive learnability results in Gold’s model for Lam- 
bek Grammars (LG) were still rare and only concerned rigid LG |614) . We present 
here a result of learnability for larger classes of LG, provided that adapted data 
are available. The advantages of our strategy are very similar to the ones we 
already argued for Classical Categorial Grammars (CG) |10lll| : types are lex- 
icalized information, easier to justify than structural information. Furthermore 
the learnability from Typed Examples is even more relevant for LG than for CG. 
As a matter of fact, the types exemplified in this paper are those of the classi¬ 
cal logical formulas associated with words in Montague’s tradition. The logical 
translation of the determiner “a” is classically XPXQ3x[P(x ) A Q(x)], the one of 
the quantifier “every” is XPXQ\/x[P(x) ==> Q(x)\, both of type ((e,f), ((e,f),f)). 
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With this typing, no CG is compatible with the Typed Example corresponding 
with “Every man loves a woman”, whereas some LG are. 

But the problem remains difficult and leads to non tractable strategies, par¬ 
ticularly for the most general case of grammars in CQ^ pe . Nevertheless, despite 
bad theoretical complexity, the algorithm for CQt ype has been programmed and 
successfully applied on small sets of natural language sentences [S] (built for 
this purpose and not extracted from a real corpus). The specialisation strategy 
it implements is recognized by psycholinguits as more cognitively relevant than 
generalisation strategies. 


Acknowledgements 

This paper is an extended version of another one written for the conference 

“Categorial Grammar” in 2004. The implementation of the algorithm was due 

to Frederic Dupont. We also thank referees for their useful comments. 

References 

1. Angluin, D.: Inductive inference of formal languages from positive data. Informa¬ 
tion and Control 45, 117-135 (1980) 

2. Bar Hillel, Y., Gaifman, C., Shamir, E.: On Categorial and Phrase Structure Gram¬ 
mars. Bulletin of the Research Council of Israel 9F (1960) 

3. Bechet, D., Bonato, R., Dikovsky, A., Foret, A., Le Nir, Y., Moreau, E., Retore, 
C., Tellier, I.: Modeles algorithmiques de l’acquisition de la syntaxe; concepts et 
methodes, resultats et problemes, Revue Linguistique de Vincennes, pp. 123-152 
(2007) 

4. Bechet, D., Foret, A.: Apprentissage des grammaires de Lambek rigides et d’arite 
bornee pour le traitement automatique des langues. In: CAP 2003, pp. 155-168 
(2003) 

5. Bechet, D., Foret, A., Tellier, I.: Learnability of Pregroup Grammars. Studia Log- 
ica 87, 225-252 (2007) 

6. Bonato, R., Retore, C.: Learning rigid Lambek grammars and minimalist gram¬ 
mars from structured sentences. In: Proceedings of Learning Language in Logic 
Workshop (LLL), pp. 23-34 (2001) 

7. Buszkowki, W., Penn, G.: Categorial grammars determined from linguistic data by 
unification. Studia Logica, 431-454 (1990) 

8. Dudau-Sofronie, D.: Apprentissage de grammaires categorielles pour simuler 
l’acquisition du langage naturel a l’aide d’informat ions semantiques, PhD thesis, 
Universite de Lille (2004) 

9. Dupont, F.: Apprentissage de grammaires de Lambek a partir de types. In: Memoire 
DEA d’informatique de Lillel (2003) 

10. Dudau-Sofronie, D., Tellier, I., Tommasi, M.: From logic to grammars via types. 
In: Proceedings of Learning Language in Logic Workshop (LLL), pp. 35-46 (2001) 

11. Dudau-Sofronie, D., Tellier, I., Tommasi, M.: A learnable class of Classical Cate¬ 
gorial Grammars from typed examples. In: Proceedings of the 8th Conference on 
Formal Grammar, pp. 77-88 (2003) 


Good Types Are Useful for Learning 


137 


12. Foret, A., Le Nir, Y.: On Limit Points for Some Variants of Rigid Lambek Gram¬ 
mars. In: Adriaans, P.W., Fernau, H., van Zaanen, M. (eds.) ICGI 2002. LNCS 
(LNAI), vol. 2484, pp. 106-119. Springer, Heidelberg (2002) 

13. Fulop, S.: On the Logic and Learning of Language. Trafford Inc., Canada (2004) 

14. Gold, E.M.: Language identification in the limit. Information and Control 10, 
447-474 (1967) 

15. Houde, O.: Rationnalite, developpement et inhibition. Presses Universitaires de, 
France (1998) 

16. Janssen, T.M.V.: Compositionality. In: Handbook of Logic and Language, pp. 
417-473. MIT Press, Cambridge (1997) 

17. Kanazawa, M.: Learnable Classes of Categorial Grammars. CSLI Publications, 
Stanford (1998) 

18. Lambek, J.: The mathematics of sentence structure, vol. (65), pp. 154 170 (1958) 

19. Montague, R.: Formal Philosophy; Selected papers of Richard Montague (1974) 

20. Moortgat, M.: Categorial type logics. In: Handbook of Logic and Language. MIT 
Press, Cambridge (1997) 

21. Oelirle, R.T., Bach, E., Wheeler, D.: Categorial grammars and natural language 
structures. Reidel, Dordrechtz (1988) 

22. Partee, B.: Mathematical methods in Linguistics, vol. (30) (1990) 

23. Pentus, M.: Lambek calculus is np-complete, Draft (2003) 

24. Pinker, S.: Language Acquisition. In: An Invitation to Cognitive Science, pp. 
135-182. MIT Press, Cambridge (2005) 

25. Retore, C.: The logic of categorial grammars. In: ACL 2001 and Rapport Inria 
5703 (2003) 

26. Tellier, I.: Modeliser l’acquisition de la syntaxe via l’hypothese de la primaute du 
sens, Habilitation thesis, university of Lille3 (2005) 

27. Tellier, I.: How to Split Recursive Automata. In: Clark, A., Coste, F., Miclet, 
L. (eds.) ICGI 2008. LNCS (LNAI), vol. 5278, pp. 200-212. Springer, Heidelberg 
(2008) 

28. van Benthem, J.: The Lambek Calculus. In: Categorial Grammars and Natural 
Language Structures, pp. 35-68. Reidel, Dordrecht (1988) 



Dialogues in Ludics 


Marie-Renee Fleury 1 , Myriam Quatrini 1 , and Samuel Trongon 2 


1 Institut de Mathematiques de Lummy, Aix-Marseillc Universite 
2 Laboratoire ’’Structures Formelles du Langage”, Universite Paris 8 

fleury@lumimath.univ-mrs.fr, {quatrini, troncon}@iml.univ-mrs.fr 


Abstract. In this paper we expose and defend the following claim: Lu¬ 
dics is a relevant framework to ensure both the formalisation and another 
way for studying dialogues. First we informally introduce a notion of di¬ 
alogue and explain the correspondence with some fundamental concepts 
in Ludics, then we give a light technical presentation of Ludics, focusing 
on the most relevant points for the study of formal dialogues : objects, 
actions and interactions. At last, we present the concrete part of the 
model with some examples of dialogues in Ludics. 


Introduction 

The very recent advances in mathematical Logic seem pertinent to renew the log¬ 
ical anchorage of language studies, for both the philosophical point of view and 
the formalisation stakes. Ludics is a new logical theory, due to J.-Y. Girard [8j, 
and arises in Proof Theory through postulating interaction as most primitive 
object, from which the usual logical concepts (formulas, proofs) may be recov¬ 
ered. We think that this reversal may be also fruitful in some another domains 
than mathematical Logic and Computation Theory. Our aim is to understand 
the relevance of new concepts in Ludics to studies of linguistics with applica¬ 
tions to formalization of concepts in this domain. Ludics can be easily related 
to pragmatics, and more generally to semantics of natural language. However 
this link is not established in the usual mode. Most of the known issues in this 
domain are simply formalizations, a.k.a. embeddings of semantic constructions 
in some formal language. But Ludics opens the way to a very special kind of rep¬ 
resentation, presaged by the perspectives of Linear Logic, which takes account 
of both logical and linguistic internal specificities. This preservation of their re¬ 
spective properties is probably the fact of the geometrical nature of logic and 
the cognitive elaboration of language. By this fact, we can expect from this new 
paradigm the development of a conceptual correspondence providing a theoret¬ 
ical framework in which both engineering and philosophy of meaning would be 
improved. 

Here we focus on dialogues, and we sketch a pattern of ludical formalisation 
of dialogues. This approach, which investigates the dynamics of interactive sit¬ 
uations, offers riche intuitions and mathematical insights easily linkable to the 
natural structure of our object. In this way, we can analyze further the inter¬ 
active layer of dialogues, leaving both the propositional and the constructive 

S. Pogodalla, M. Quatrini, and C. Retore (Eds.): Lecomte Festschrift, LNAI 6700, pp. 138 |l57| 2011. 
(c) Springer-Verlag Berlin Heidelberg 2011 
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ones0. We expect that such a new perspective may be useful for filling out the 
current models of dialogues. Nevertheless, the precise relation of our analatycal 
approach and previous work on dialogues deserves further examination. 

The text is build as follows: in the first section, we introduce our model by 
presenting a not formal notion of dialogue, and by explaining the correspondence 
with some core concepts of Ludics. These remarks will be useful to understand 
the continuity between intuitive dialogue ( especially seen from the point of view 
of meaning construction) and formal dialogues. In the second section we give a 
light technical presentation of Ludics, focusing on the most relevant points for the 
study of formal dialogues: objects, actions and interactions. Finally, we present 
the concrete part of the model with some examples of dialogues in ludics. We 
begin with an elementary decomposition by the intervention-action matching. 
Secondly, we introduce a higher level of formalisation, refined approach of the 
same object, by viewing interventions as complex actions. Finally, we open the 
level of inside relations, considering superstructures and recursive dialogues. 

1 Preliminary Remarks 

Intuitively, we can define the dialogue as a common research done by some speak¬ 
ers which want to establish some knowledge by exploring possibilities opened by 
some thesis and its counter-thesis. This intuitive notion of dialogue corresponds 
in some way to the greek concept of dialegesthai ( gr. BtaXeyecrdau), which is a 
primitive notion in the philosophy of knowledge, as opposed to the more tradi¬ 
tional dialectic. 

We describe dialogues as sequences of polarized actions which constitute the 
chronology of symbolic exchanges underlying the communication of meanings. 
Their structure represents both the research itself, seen as a process in progress, 
and the knowledge itself, seen as a stable object, finite at one step of development. 

The dialogue carries out three essential functions: exchange of informations, 
construction of knowledge, resolution of a cognitive tension. 

First, at every stage, a speaker is giving a symbol, and this exchange is in¬ 
formative in three ways: it informs us about the object discussed (some thesis), 
the subject who is speaking (his approach about this thesis), and the connec¬ 
tion between a present intervention and some counter-interventions (upstream 
or downstream, actuals or virtuals). 

Second, running dialogue shows arguments interacting like machines built up 
to explore relevant opportunities of discussion according to some global strategy: 
I argue in this way to reach this point, I open these branches to induce some 
reactions... So, dialogue is a sort of unfolding structure which represents some 
knowledge. Evidently, involving friendly but tenacious interlocutors ensures a 
good (exhaustive) exploration. 

Third, by the interaction, the locutors can extract some new information 
which is about the shape of the interaction, contained in the result of the dia¬ 
logue: what is stable, what is explored, what is new, what is in latence. 


1 Here “constructive” means the way how the arguments logically articulate. 
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Interpreting ludics as a paradigmatic level which shows natural dynamics in 
logic, we can find some correspondences between dialogue functions and proper¬ 
ties of the logical world described in ludics. 

1.1 Action Dynamics 

First, we must observe that abstract identity (the same name refers to the same 
content) is replaced in Ludics by a concrete identity based, as in a game, on a 
behavioral criterion requiring concrete observation: 

reactions experienced by a player corresponds likewise 
to the actions of his opponent. 

Consequently, a logical entity a is characterized by the set of all its interactions 
with the rest of the logical world. And a is identified with a' when they interact 
exactly in the same manner with this world. This idea is very close to the fact that 
conceptual identity does not exist between two natural terms or two sentences, 
since it is only in the consideration of the context, and so in a definite situation 
of interaction, that we can evaluate a semantic item. 

This property, well known as holist evaluation in semantics of natural lan¬ 
guage, is produced by the localization constraint. In the old logical style we had 
different occurrences of a same content. Context-sensitive logics, like linear logic, 
introduces occurrences linked by an orthogonality relation, permitting a strict 
ressource management. By the fact of localization, occurrences became locations 
(loci) in a geometrical framework and, necessarily, locations are considered as 
different ones. 

So, we have a first notion of meaning, assuming that the set of all the 
possible interactions with other elements of the logical world is the conceptual 
meaning of my strategy, by the fact that interactions explain what can be ex¬ 
pected from my actions. The reaction of my opponent is the meaning of my own 
action: it is induced by actions I have done, and it induces reactions by me which 
open or close possible moves. The first step of our model will give sense to this 
intuition by setting the following elementary decomposition: one intervention is 
treated as a ludic action, and actions are mirroring themselves. 

1.2 Exploration Dynamics 

We can also refer to locations in two ways, depending on the cognitive subject 
position: consumer vs. producer, emettor vs. receptor, speaker vs. addressee, etc. 
In linguistics this point corresponds to the fact that duality is not an objective 
reality but a local opposition between persons who can change their positions. 
Even if we know (at least conceptually) that strategies interact with all the 
strategies in the world, there is a very special relationship between a set of 
strategies, its counter strategies (a sort of optimal opponent which explores all 
the determinations of the player’s intentions) and the counter strategies of these 
counter strategies, that is to say the initial set. 
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So we have a second notion of meaning. My strategies are the mean¬ 
ing of the strategies of my opponent, since he made choices in all the playable 
strategies in order to select one which can optimally play against mine. Natural 
language sentences are not considered by themselves but regarding the interpre¬ 
tation processes they suggest. There is a construction of mine, its deconstruction 
by my opponent, a re-construction knowing that he knows what I said, and his 
re-deconstruction knowing that I know that he knows... 

The second step of our model will focuse to the explorative part of interven¬ 
tions, which are made in order to explore some thesis, by anticipation on the 
possible ways which would be choosen by an opponent. This will give a second 
level of formalisation, considering interventions as more complex structures: an 
intervention is a whole strategy, made of anticipations (about the opponent) and 
forecastings (about my own further plays). 

1.3 Inside Dynamics 

Geometrical logics, whose ludics is a very representative one, are founded on the 
calculus dynamics. That is the crucial point in our attempt considering the fact 
that too many theories in this domain neglect to develop a really good dynamic 
part. Till now, we know very expressive systems, based on a lambda-calculus, 
but without specific attention on /3-reduction and 77 -expansion processes. On the 
contrary, here, we consider dynamics as the essential part of our research. Ac¬ 
cording to this principle, we propose a third notion of meaning: the meaning 
of our common search is the form of our interaction itself. If we diverge, it means 
that no durable connection is possible between our games. But we explore, by 
means of this divergence, some determinations of the invoked strategies. And we 
would be able, in future games, to modify our intentions in order to greatly ex¬ 
plore offered possibilities. In the case of convergence, a really useful exploration 
is always possible, which produces a new knowledge, not present explicitly in 
our past determinations. Useful interactions are interactions which urges us to 
explore more deeply the determinations of our playing intentions. 

Considering dynamics, we must introduce a level of complexity, presented in 
the third part of the model. Inside the on-going dialogue, we must have the ca¬ 
pability of duplicating some modules, invoking strategies used in previous stages 
and dialogues, inserting dead-ends and closed loops, cheating in the argument 
arborescence, or disrupting opponent’s plans. These are complex processes of de- 
and re-localization inside the running dialogue, and with outside parts turned 
inside out, as a sort of high-level transfer of training mechanism. 

Lastly, let us precise that there is no logical propositions in this model of 
dialogue. In a more global approach of communication, we suppose a kind of 
multilevels structure, each level corresponding to a way of taking in account the 
object. The more abstract level is concerned by the propositional aspects of com¬ 
munication. And we found at the most primitive level the interactive structure 
of dialogue itself. The work, in the present text, is clearly based on the latter. 
Fortunatly, it would be possible to invoke some principle of compilation between 
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levels which ensure continuity and restore the connection between abstract and 
concrete views. But, this is another work. 

Here, we do not use propositional analysis as in the syntactic style, and struc¬ 
tures we present are not based on conceptual contents as in the semantic styles, 
because we want to get our model away from the semantic/syntactic duality. We 
propose a third way, based on the exchange itself, considering the dialogue as 
a sequence of interactions in some place at some time. So, a locus would be a 
place and a moment of dialogue, and not a linguistic object considered on its 
own grammatical structure. 


2 Ludics in a Nutshell 

Ludics can be sum up as an interaction theory. It appears in the work of 
J.-Y. Girard [6. as the issue of several changes of paradigms in “Proof The¬ 
ory’ll: from provability to computation , then from computation to interaction. 
The first change of paradigm arises with the intuitionnistic logic, while the sec¬ 
ond is due to the development of linear logic. Continuing the new approachs of 
Linear Logic : a geometrical point of view of proofs ; an internal approach of 
dynamics, Ludics focalizes on the interaction. 

The objects of the Ludics are no more proofs but instead incomplete proofs, 
attempts of proofs. So a rule called dai'mon is available in order to symbolize the 
giving up in a proof search or a pending lemma. These objects play the role of 
a proof architecture. Only what is needed for the interaction is kept. This has 
been made possible by means of the hypersequentialized linear logic introduced 
by J-M. Andreoli after he has discovered the polarity of formulas. Moreover, this 
work within polarized objects world allowed to create a link [3] between Ludics 
and recent works in Game Semantics which share similar motivations. So the 
Game Theory is a good metaphor for a first approaclH of Ludics, and it is the 
point of view often followed in this text. 

2.1 The Objects of Ludics 

The central object of Ludics is the design. By means of the metaphor of Games, 
a design can be understood as a strategy , i.e. as a set of plays (chronicles) end¬ 
ing by answers of Player against the moves planned by Opposant. The plays are 
alternated sequences of moves (actions). The moves are defined as a 3-uplet con- 
stitued by : firstly a polarity (positive polarity for a move of Player or negative 
polarity for a move of Opposant), secondly a locus (a fixed position) from which 


2 A deep presentation of the philosophic and epistemologic point of view of the math¬ 
ematical logic progress (when computer science is concerned in) can be found in the 
works of J.-B. Joinet 0 and S. Trongon (S3- 

3 Here you find a presentation of the theory in a very simplified version, but we 
recommand the source texts m , ezi to the reader concerned with more details on the 
mathematical notions and rich concepts of Ludics ; we also recommand the reading 
of this introduction J2]. 
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the move is anchored, and at last a finite number of positions reachable in one 
step (ramification). A unusual positive move is also possible : the da'fmon. 

In Ludics, the positions are addresses, loci incoded by means of a finite se¬ 
quence of integers (often noted £, p, cr ...). 

The starting positions (forks) are denoted r \~ A; where r and A are finite 
sets of loci such that r is either the empty set or a singleton one. When an 
element belongs to T, every play then starts on this element by means of an 
Opposant move (and the fork is said negative), else Player starts on an element 
of its choice taken in A (and the fork is said positive). 

For the hypersequentialized linear logic point of view, a design can be seen 
as a figure of a proof in this sequent calculus with some particularities : first 
we can use the daimon rule, for giving up the proof search ; secondly we do not 
work with formulas but with addresses; lastly only two rules are sufficient for 
subsuming the usual logical connective rulefl 

Definition 1 . A design is a tree of forks r b A, built by means of these three 
rules : 


— Daimon 


b A 


t 


— Positive rule 


f.i b Ai 


(£,-0 


where I is an eventually empty ramification such that for every couple of indexes 
( i,j ) £ I, Ai and Aj are disconnected and every Ai is include <0 in A. 

— Negative rule 


S-I I- 
(bd 


(£,A0 


where M is a possibly empty or infinite set of ramifications such that for all I £ A f, 
Ai is included in A. 


Example 1 (Faggian-Maurel contract in Ludics). Bob offers the following con¬ 
tract to Alice: give me one euro and you can choose either a book or a surprise. In 
the latter case, I will give you a CD or a DVD.0 

4 It is a direct consequence of the focalization property : a proof of a formula in usual 
linear logic can be replaced with a proof of an equivalent formula written by means 
of polarized synthetic connectives. So only two rules schemes (positive or negative 
ones) are needed. 

5 Every rule where the union of the Ai is strictly included in A correspond to the 
weakening rule (respectively for negative rule when Ai is strictly included in A). 

6 This contract can be described by means of this Linear Logic formula : 

1 euro —o (1 book &(1 CD © 1 DVD)) 
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We are going to represent by means of designs the Alice and Bob’s strate¬ 
gies for the dialogue based on this contract. In the next section we shall study 
this contract in progress by studying the effect of the interaction between these 
designs. We arbitrarily start the interaction at locus £. 

Two Strategies of Bob in Ludics. The two strategies begin in a same manner: 
first Bob sets out the contract (represesentecl by a positive action (£, {0})); then 
he is ready to receive one euro and give a book (represented by a negative action 
(—, £.0, {1,2})), and he is also ready to receive one euro and choose a surprise 
for Alice ((-,£.0, {1,3})). 

In the first strategy, Bob ends the exchange by giving the book in return of 
one euro ; in ludics, we will say that he plays the daimon ; playing the daimon 
in a strategy allow us to attest that the exchange correctly endfl In the second 
strategy, Bob gives a surprise in return of one euro. 

Then the two strategies differ according on Bob gives a CD or a DVD. “Bob 
chooses the CD” will be represented by the positive action (+, £.0.3, {1}), while 
“Bob chooses the DVD” will be represented by the positive action (+, £.0.3, {2}). 

_ t £.0.3.1 b £.0.1 _ t £.0.3.2 b £.0.1 

b £.0.1, £.0.2 b £.0.1, £.0.3 b £.0.1, £.0.2 b £.0.1, £.0.3 

COb COb 

Hi b£ 


Fig. 1. Two strategies of Bob 


Three Alice Strategies in Ludics. The three strategies begin in a same 
manner : she listens to the contract (she plays a negative action (—,£, {0}). 

— in the first strategy, “she gives an euro and chooses a book” is represented 
by the positive action (+, £.0, {1, 2}) ; in the second and third strategies “she 
gives an euro and chooses a surprise” is represented by the positive action 
(+, £-0, {1, 3}). 

— in the second strategy, she chooses a surprise, so she is ready for both even¬ 
tualities : receive a CD or receive a DVD ; so this is represented by two 
negative actions (—, £.0.3, {1}) and (—, £.0.3, {2})). In these two cases, she 
ends up the exchange by the positive action: f (great, thank you). 

— in the third strategy, she chooses a surprise, but she only is ready for receiving 
a CD ; this is represented by the negative action (—, £.0.3, {1}). In this case, 
she ends up the exchange by the positive action f (fine, thank you). 

Remark 1. Let us notice that to receive one euro and to give one euro are rep¬ 
resented by the same action. This illustrates, indirectly, the fact that, in Ludics, 
negation only consists in exchanging the point of view. 

7 Another possibility would be to make Bob playing an action (“Bob gives Alice a 
book”) and then make Alice playing the daimon (“Alice thanks Bob”). 
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- 1 - 1 - 1 

he.0.3.1 I- £.0.3.2 b £.0-3.1 

£.0.11- £.0.2 1- £.0.1 h £.0.3 b £.0.1 h £.0.3 b 

b£4) b£T b£T 

£~i~ £> £h 

Fig. 2. Three strategies of Alice 


2.2 The Interaction 

The designs are built on the model of proofs without cut. The underlying signi¬ 
fication of the cut is the composition of morphisms or strategies. In Ludics, it is 
concretely translated by a coincidence of two loci in dual position in the bases 
of two designs. We can cut for example a design of base er b £ and a design of 
base £ h p, so forming a cut-neid of base cr b p. 

The interaction is obtained by means of the cut ; it creates a dynamics of 
rewriting of the cut-net; the process continues as long as we may find an negative 
action corresponding to the current positive one until a daimon is played. When 
it is not the case the process fails. Otherwise we obtain a design with the same 
base as the starting cut-net. 

Example 2. The interaction between the second strategy of Alice and the first 
one of Bob: 

- 1 - 1 

b £.0.3.1 b £.0.3.2 _ + £.0.3.1 b £.0.1 

£.0.1 b £.0.3 b b £.0.1, £.0.2 b £.0.1, £.0.3 

b£4) £hb 

After the first reduction step we get: 

- 1 - 1 

b £.0.3.1 b £.0.3.2 _ + £.0.3.1 b £.0.1 

£.0.1 b £.0.3 b b £.0.1, £.0.2 b £.0.1, £.0.3 

b£4) £Tb 

After the second reduction step we get: 


£.0.3.1 b £.0.1 
£.0.1 b b £.0.1,£.0.3 


- 1 - 1 

b £.0.3.1 b £.0.3.2 

£.0.3 b 

_ 


8 A cut-net is a finite graph of designs the bases of which are pairwise connected by 
cuts ; the cut-net is connected and without cycle. The base of the cut-net is obtained 
by erasing the cut loci. 
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For the next step, we first choose^ the cut on £.0.3, then we get: 

-1 

b £.0.3.1 £.0.3.1 b £.0.1 £.0.1 b 

I__J I_ 

which is reduced in: 

-1 

1- £.0.1 £.0.1 b 

I_ 

The reduction ends up on T>ai + . 

Example 3. The interaction between the third strategy of Alice and the second 
one of Bob: 

-1 

b £.0.3.1 _ t £.0.3.2 b £.0.1 

£.0.1 b £.0.3 b b £.0.1, £.0.2 b £.0.1, £.0.3 
b£d) £Tb 

£b b£ 

I_I 

As previously, after two reduction steps we get: 

-1 

£.0.3.2 b £.0.1 b £.0.3.1 

-(e-o.3,{2» -{{1}} 

£.0.1 b b £.0.1, £.0.3 £.0.3 b 

I_I I_I 

The next step produces a failure because {2} ^ {{1}}. 

Example 4- Crucial normalisation: a design against the Tax. 

The Tax^ is the following design which is recursively defined (where Vf(N) is 
the set of finite subsets of N) : 

Tax^i Tax e _ x,, 

z'-i ■ Z-i z'.j b £,'/ 

- W,i) - 

b £./,£' ... £••/.£' 

-(e.p/W) 

£b£' 

The normalization between this design and a design based on £ b is the copy 
of the original design. Except that in the whole design, the locus £ has been 
replaced with the locus £', what justifies its name: “Fax”. 

Dispute. In Ludics, the notion of dispute allows us to report the sequence of 
the moves (actions) of the play (interaction between two designs connected by a 
cut), from the point of view of one of speakers. 

9 The order chosen for executing the reduction steps is relevant in some rewriting 
system but it isn’t the case in Ludics due to the separation theorem established 
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For example, in the first scenario of the foregoing example (FM-contract between 
Alice et Bob), the dispute, from the point of view of Alice is : 

(-, U0}),(+,e0,{l,3})(-,£.0.3,{l})t 


3 Dialogues in Ludics 

In this attempt to provide the dialogues with a formal frame, we shall be inter¬ 
ested only in the elements of the dialogue which are supports of the interaction. 
We superpose then a deconstruction of the dialogue (articulation and analysis of 
the successive interventions according to the created opportunities) and a recon¬ 
struction of strategies (whose aim is the continuation of the dialogue), on which 
the dialogical interaction is based. In this context, a dialogue is the result of an 
interaction between the strategies of two speakers. 

The formal decomposition of dialogues will be considered at various levels of 
granularity. 

— For an elementary decomposition, a dialogue is seen as an alternation 
of signed interventions, where intervention means: one or several successive 
sentences uttered by a speaker before the turn of its addressee. 

— We can then refine this approach and decompose the interventions them¬ 
selves, with regard to the way they are dynamically built. 


3.1 Elementary Decomposition 

A dialogue and the interventions of each speakers are observed from the point 
of view of the interaction: on what previous interventions of the speakers a 
intervention of one of them is attached and which openings are created for the 
continuation of the dialogue. As announced in the preliminary remark, we do 
not retain the propositional contain. 


Example 5. Tomorrow the weather will be fine, I will go to work to Lummy by bike.’ 

What could be the answers of addressee? With this utterance, the speaker has 
opened three potential answers: 

1. Are you sure? Did you consult the weather forecast? 

2. Are you still working at Luminy? 

3. I did not know you are so good at sport! 


We only are concerned in studying the geometrical aspect of a dialogue. So 
this utterance with its three created possibilities is represented by the following 
design: 


g-ll- g-2h 
I-£.4 


£.3h 




Let us resume what is taken in account in our modelisation of dialogue: a dialogue 
is an alternated sequence of interventions; an intervention is anchored at a certain 
locus among those created by the previous interactions and it creates new ones; 
some interventions allow to close the dialogue. 
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Our formalization of dialogues is built by means of the following elements: 

— An intervention of Speaker or Addressed^ is an action (e, £, I), where: 

• e is a polarity : + (from the point of view of the speaker who performs 
the intervention) or — (from the point of view of the one who records 
the intervention); 

• £ (the focus), is the point from where the speaker either ends the con¬ 
versation or follows up on the opportunities created during the previous 
exchanges; 

• I (the ramification) is the set of openings created by this intervention. 

— A dialogue is a sequence of alternated interventions ; the story told by one 
of the speakers of this alternation will be represented by a chronicle or in 
a more dynamic way by the trace of an interaction between the strategies 
of each of speakers (a dispute). In a same way, in Ludics an alternated 
sequence of actions can be view either from a static point of view (to tell 
about a past play) or from a dynamic point of view (to take part into the 
play in progress). 

— A strategy of one of the speakers will be represented by a design. In order 
to build a strategy as the tree of its compulsory or offered possibilities, the 
following rules are used: 

• to play a positive rule / to perform an intervention; 

• to play a negative rule / to record, to anticipate the interventions of the 
other one; 

• to play the da'imon / to terminate a dialogue. 

Example 6 (“Sales of real estates”). Let us consider the following situation and 
let us imagine several dialogues about it: 

P knows that O have three real estate properties Ai, A 2 , A 3 ; he heard that O 
would like to sell some of its real propertiy. P is interested in the real property 
Ai, also he starts up a dialogue with O. P wants to know if O intends to sell 
A 1 ; if yes, at what price does he sell it; he does not want to show immediatly 
that he is interested in this purchase. 

— First possible dialogue : Dial 1 

P: 1 have heart that you would like to sell some of your real estates, which one? 
O : I intend to sell Ai and A 2 . 

P: At what price does you sell Ai? 

0 : 100000 euros 

P : OK 

This dialogue Dial\ can be represented as the result of the interaction be¬ 
tween the two following designs: 


10 


To indicate the actors of a dialogue we shall either use P for Player and O for 
Opponent as in Games Theory, or Speaker and Addressee as in Dialogues Theory 
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-1 

b cr.0.1.5.100, cr.0.2 

<7.0. 1.5 | <t.0.2 

b <7.0.1, <7.0.2 
<7.0 b 

b <7 

P I- 


<7.0.1.5.100 b 
b <7.0.1.5 

<7.0.1 b <7.0.2 b 
b <7.0 

<7 b 

o 


Finally, the dialogue Dial i is represented by the following dispute (from the 
point of view of P) : 

(+,<7,{0» (-,<7.0, {1,2}) (+,<7.0.1, {5}) (—,<7.0.1.5,(100}) (+,f) 

I have heart. A\ and A 2 . . . price of A\1 100000 euros OK 

Comment 1. • <7 is the locus where the dialogue starts. 

• <7.0 is the locus of the question I have heard ... 

• <7.0.1 and <7.0.2 are the loci of the answers I sell Ai and I sell A 2 . 

• <7.0.1.5 is the locus of the question what is the price of A{1 

• <7.0.1.5.100 is the locus of the answer 100 000 euros. 

• P decides to stop the dialogue by playing f. He might continue by ques¬ 
tioning about the real estates A 2 (by focalizing on cr.0.2). 

— Second possible dialogue : Dial 2 

P: I have heard that .... 

O: I intend to sell A 2 and A 3 . 

P: Very well, good luck 

This dialogue Dial 2 is seen as the interaction between the two following 
designs: 


-1 

b <7.0.2, <7.0.3 <7.0.2 b <7,0,3 b 

<7.0 b b <7.0 

b <7 <7 b 

P I-10 

Finally the dialogue Dial 2 is represented by the following dispute (from the 
point of view of P: 

(T, < 7 , {0}) (—, <7.0, {2,3}) (+,f) 

I have heard ... I sell A 2 and A 3 Well, good luck 

Comment 2. • <7 is the locus where the dialogue starts. 

• <7.0 is the locus of the question I have heard . . . 

• cr.0.2 and <7.0.3 are the loci of the answers I sell A 2 and I sell A 3 . 

• P is only interested in A 1 , so he decides to stop the dialogue by playing f. 

It is possible to represent by the following design the project of conversation 
that P has built to obtain the following information: O sells A\ and if yes at 
what price? 
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— 

I- o-.0.1.5.i . . . 
<7.0.1.5 b 
b <7.0.1,0.2 


— 

b < 7 .o.i.5.; ... 
<7.0.1.5 b 
b <7.0.1 


— 

. . . h <7.0.1.5.i . . . 

g.o.l.5 h 
h <7.0.1., 0.3 
<7.0 h 

h <7 


-1 

. . ■ h <7.0.1.5.2 . . . 

<7.0.1,5 h _.,. 

h <7.0.1, <7.0.2, <7.0.3 h <7.0.3 


It is really a strategy of conversation : P imagines the possible answers of O to 
his initial question and plans how to pursue the dialogue in each case. 


3.2 Refined Approach 

Until now, we have represented a dialogue as a chronicle or as a dispute. This 
modelisation seems well adapted to dialogues as long as we only consider a 
dialogue as an exchange of information. But some refinement are needed as soon 
as we are concerned in more elaborated dialogues, as controversies. 

As first refinement we propose that the interventions of a speaker could be 
more complex than an action: a whole design, possibly a cut-net, can be played 
instead of an action. Associating a whole design or a cut-net with the intervention 
of the speaker is the sign that this exchange could be broken down into more 
elementary ones. We shall perform this operation in order to take care of the 
dynamics of these exchanges. 

The example of the presupposition will illustrate such an extension. An inter¬ 
vention is no more represented by actions (moves) but by whole designs (plays). 
This extension again will be used in Sect. I3.3l for the studying interventions us¬ 
ing elements of the context in the ongoing dialogue (for example, an intervention 
will be represented by means of a delocalization of a design created in previous 
dialogues). 

At last, Ludics also supplies us with tools to formalize the “picking up again” 
in a dialogue: each speaker can correct a past intervention and propose a new 
one. 

Presupposition. It is an implicit assumption about the world or background 
belief relating to an utterance whose truth is taken for granted in discourse. We 
propose to associate an intervention containing a presupposition with a whole 
chronicle instead of a only action. 

Let consider this well known example due to Aristote ; a judge asks a young 
delinquent this question: Do you still beat your father?. The judge asks a question 
that presupposes something that has not necessarily been accepted by the young 
delinquent. The judge imposes to the delinquent the following exchange: 

Do you beat your father? - Yes - Do you stop beating him? 

This exchange between the judge J and the delinquent D must be represented 
by the following interaction according to the previous elementary formalisation: 
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£. 0 . 1.0 h h £. 0 . 1.0 

h £. 0.1 h £. 0.2 £. 0.1 h 

£.0 h h £.0 

J I-D 


But the judge utterance: Do you still beat your father? contains a presupposition. 
So it can’t be represented by a only action, but by a whole chronicle: 

(+,U0}) (-,£.0,{1}) (+,£.0.1, {0}) (-,£.0.1.0, {l})/(-,£.0.1.0, {0}) 

Do you still beat your father? yes/no 

Judge’s intervention Delinquent’s intervention 

So J forbids addressee a branch who was due to him (the possibility to answer 
No). If D agrees to answer according to this configuration (without diverging) he 
is trapped: he has to record a whole chronicle and answer from the locus £.0.1.0 ; 
so he implicitly answered the question Do you beat your father? by yes . 


Picking up again. We want to mean that in a dialogue a speaker can forget 
the current direction of the discussion and proposes a new one instead of it ; in 
terms of games, a player can play a new move instead of a previous one. 

In linear logic, the exponential formulae were essentially introduced to give the 
possibility of identifying various occurrences of the same formula (perform a con¬ 
traction rule in linear logic). In order to integrate this possibility in Ludics, sev¬ 
eral propositions were advanced. Michele Basaldella and Claudia Faggian in [Tj 
suggest to handle multi-addresses (indexed addresses). These multi-addresses 
give the possibility (not authorized until now) of replaying a positive action on 
an already visited locus (provided that this locus is a multi-address). 

We shall not clarify more the technical aspects of this notion; we keep in mind 
this possibility and we shall illustrate it by means of an example read in the text 
from Schopenhauer “Dialectica Eristica”, illustrating its first stratagem. 

I asserted that the Englishmen were supreme in drama. My opponent attempted 
to give an instance to the contrary, and replied that it was a well-known fact that 
in opera, they could do nothing at all. I repelled the attack by reminding him that 
dramatic art covered tragedy and comedy alone .... 

This dialogue can be represented by the following interaction between two 
designs (we use an alternative presentation of interaction, which is more relevant 
to manipulate designs with multi-adresses): 


design of P: 


design of O : 


Dialogue: 


(+,£,{!}) 


(—,£.!,{{!}, {2}}) 


ci = (— ,£,{1}) Englishmen are supreme ... 
(+,£.1,{3}) ... useless in opera 

... tragedy and comedy alone 
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In such a case the interaction diverges: there is no negative action corre¬ 
sponding with the positive one. But we may imagine that the dialogue may 
nevertheless continue: the speaker P is ready to receive a new attempt of O. 
An account of such a possibility may be given by replacing the first action of P 
which is (+, £, {1}) by an action where the focus is a multi-address (+, £ii, {1}). 

In this way, the previous exchange can be extented and,for example, it may 
become: the Englishmen are supreme in drama - it is a well-known fact that in 
opera, they can do nothing at all - but by "dramatic art” I means tragedy and 
comedy alone - So, in this case, I agree .... 

Which can be represented by the following interaction: 


design of P: 


design of O : 


Dialogue: 


(+,£.*!,{!}) 

(-, *.?i.1,_{{1},{2}}) 
( + , £-* 2 , { 1 }) 


Cl = (-J €■*!,{!}) 

(+,£.?i.l,{3}) 
C2 = (-,C-*2, {1}) 

t 


Englishmen are supreme ... 
... useless in opera 
but .. 

... tragedy and comedy alone 
So.... I agree 


The last intervention of P is represented by the successive actions 


( — {{!}, { 2 }})(+,£-* 2 , {!}) 


which precise that P may not receive the argument that his addressee proposed 
but nevertheless he is ready to accept another attempt: to use a multi-address 
indeed enables us to play again on the locus which has been already used. 


3.3 Towards More Complex Dialogues 

Ludics seems to be a fruitful framework to deal with more complex aspects of 
dialogues. During a dialogue, the speaker builds its strategy by using various 
elements: he can use pieces of former dialogues ; he can use some contextual ele¬ 
ments... And, of course, he can use stratagems or dialectical tricks. It is possible 
to describe such dialogical facts in Ludics. We rest on the following remark: in 
Ludics, the designs themselves can be seen as resulting of interactions. We al¬ 
ready saw, in presupposition case, that some interventions have to be associated 
with some designs already built rather than with some elementary actions. We 
will go further in associating with some elaborated interventions some cut-nets: 
several interacting designs. Particularly, this will be used to simulate the fact 
that the strategies supporting one dialogical interaction can be worked out by 
means of designs which are coming from outside the dialogue in progress. We 
then need to set that some designs, in fact a set of designs (a context) is available 
to the locutors when they build their interventions. 

We will illustrate this possibility to deal with more complex aspects of dia¬ 
logues by studying two examples: first application of Ludics is proposed to deal 
with one of the stratagems suggested by Schopenhauer in AH of Always Being 
Right ; a second one illustrates the possibility to explore in Ludics the core of 
fallacious sophisms by dealing with the petition of principle. 
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Study of the 4th Stratagem of Schopenhauer. We sum up the fourth 
stratagem below: 

If you want to draw a conclusion, you must not let it be foreseen, but you must 
get the premisses admitted one by one, unobserved, mingling them here and there 
in your talk: otherwise, your opponent will attempt all sorts of chicanery. Or, 
if it is doubtful whether your opponent will admit them, you must advance the 
premisses of these premisses; that is to say, you must draw up pro-syllogisms, 
and get the premisses of several of them admitted in no definite order. In this 
way you conceal your game until you have obtained all the admissions that are 
necessary, and so reach your goal by making a circuit. [...] 

Let us suppose the following situation: the speaker (here designed by “player” 
or P while addressee is designed by “opponent” or O) defends a thesis A ; he 
wants to justify A by resting on the fact that the propositions B and C imply A. 

— Some dialogical exchanges took place. The player affirmed B , which was 
accepted by O. In the same way, P affirmed C, which was also accepted by 

O. 

That is represented as follows: the proposition B was played on an arbitrary 
locus a, O recorded this affirmation and accepted it (he gave up). The same 
for C on an arbitrary locus (3. 

The following interactions took place: 

q.0 b b q.Q Z 3 - 0 h h ^° 

b a a h P /3 b 

P 1 -O P 1 -O 

Let us denote by T> a and T>p the winning designs of P respectively based on 
b a and b (3. 

The supports of such exchanges have been recorded and will be still avalaible 
when the speaker P will play its proposition A. 

— Now, we come back to the ongoing dialogue: P is asserting its thesis A, 
arguing it by means of the premises B and C and disclosing its stratagem. 
This intervention (initiating the (short) ongoing dialogue) is represented by 
the following design V, located in f: 


Pi 

P-2 

€-ib 

£.2 b £.3 b 



The first action of this design is (+, £, {1, 2,3, }) where £.1 is the locus of the 
argument B , £.2 the one of C and £.3 the locus of the proposition BAC => A. 
— Let us comment the construction of the subdesigns (normal forms of cut- 
nets) IZi and IZ 2 of V: 
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• the design 1Z\ is built from the winning design T> a by using: 

* one delocalisation from a into £.1.0 (the proposition B affirmed “out 
of context” in a or affirmed in the context of the defense of the thesis 
A in £.1.0); 

* one shift (the proposition B affirmed in £.1.0 is used as an argument 
when it is localized in £.1). 

The design TZi, based on £.1 b, is the normal form of the cut-net con¬ 
sisting in the interaction 11 ! between V a and 3ax a ,£.i.o- 
Then Pi =| [[D a , 5oi a ^.i.o]] ; 

• In the same way: P 2 = j [[Dp^aXf 3 ,j. 2 .o]] an d is based on £.2 b. 

P is then in a good position to win the controversy. Indeed the reaction of 
O is strongly constrained ; this can be seen by looking at the interaction. 
After normalization, the design corresponding to the intervention of P is the 
following: 


£.1.0.0 b £.2.0.0 b 
b £.1.0 b £.2.0 
£.lh £.2b £-3 b 

h £ 


In order to converge with this intervention of P, during the dialogue in 
progress, O has to develop the following design: 

b £.1.0.0, £.2.0.0, £.3 

£.2.0 b £.1.0.0, £.3 

£.1.0.0 1- £.2.0.0 b b £.1.0.0, £.2, £.3, 

b £.1.0 b £.2.0 £.1.0 b £.2, £.3 

£■1 b £.2b £.3h b £.1, £.2, £.3 

h £ £ h 
p 1 -o 

That is: O has to recognize that T>i is the shift of one delocalisation of T> a 
(and V 2 of Z?g), and has to remenber that against this designs he may only 
play the daimon (to stay coherent with itself). 

Then it is the turn to O to play. He is in the position b £.1.0.0, £.2.0.0, £.3. 
The only opening for O would be to play an action located in £.3 (since 
either on £.1 or on £.2, he can only play the daimon.) ; if he has nothing to 
oppose to the proposition B A C => A then O accepts the thesis of P, and 
plays the daimon. 


li 


Such an interaction enables to delocalize the design V a from the locus where it was 
really played until the current exchange. 
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b £.1.0.0, £.2.0.0, £.3 
£.2.0 b £.1.0.0, £.3 
£.1.0.0 h £.2.0.0 h b £.1.0.0, £.2, £.3, 

b £.1.0 b £.2.0 £.1.0 b £.2, £.3 

£.lb £-2 b £-3 b b £.1, £.2, £.3 

b£ £b 


Petitio Principii. Our aim now is to understand in Ludics “petitio principii” 
(or “begging the question” ). Rather than an action, rather than an already 
built designs, we will represent some intervention using such a logical fallacy by 
a whole cut-net (several design linked by cuts). The loci to which the addressee 
could cling should appear after the normalisation of such a cut-net but these 
loci are in fact not available. 

— either because these places are pushed back to the infinite. It is the case 
when the logical fallacy is due to a circular reasoning (traditional usage of 
the petition principii). 

— or because theses places are pinched. It is the case when “begging the ques¬ 
tion” consists in imposing the premise of a thesis as it was commonly admit¬ 
ted instead of offer it to the discussion ; in presenting evidence (in support 
of a conclusion) that is less likely to be accepted than merely asserting the 
conclusion (contemporary usage of the petition principii). 

We illustrate the foregoing affirmations on two examples corresponding with the 
two mentioned cases of petition principii. 

— Let us consider at first the following utterance: The soul is immortal because 
it never dies. This is a traditional usage of “begging the question”: circular 
reasoning. Indeed an affirmation the soul is immortal is justified by an another 
one it never dies, which has the same meaning. 

We propose to formalize this utterance The soul is immortal because it never 
dies , by the following (recursively defined) design: 

[[X>j, tfax^ii]] 

I-£1.1 
£ 11 - 
V<= b £ 

Let us comment how such a recursive design may be associated with the 
utterance The soul is immortal because it never dies: 

This utterance contains an affirmation the soul is immortal and explicitely 
contains its argument the soul never dies; it is then reasonable to set that 
this intervention, do not reduce to be only one action but is a design already 
built Vc. The first action: (+,£,{1}) indicates that there is an affirmation 
(arbitrary located on £). The second one: (—,£.!,{!}) expresses the fact that 
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the locutor is ready to support its argument the soul never dies. The design 
V^ have to contain also the defense of the soul never dies (which is nothing but 
because it is immortal). Therefore, the subdesign above b £.1.1 has the same 
content than T>^, except that the loci of the two affirmations are exchanged. 
This is expressed by the delocalization of the design Vc from £ into £.1.1. 
Technically speaking, the design above b £.1.1 is Vc 1A = [[V^, 

The resulting design is infinite; the loci to which the addressee could cling 
are never available: 


£.1.1.1.1.1 b 
= b £.1.1.1.1 
£.1.1.1 b 
% 1.1 = I " £- 1.1 
£H~ 

©£ = I" £ ' 

— Let us consider now an another case of the petition of principle. The one 
consisting in imposing one of the premises of an affirmation as if it was 
commonly admitted. An intervention using such a petition of principle may 
be presented by the following schema: Since A (which must have been justified 
but which is taken for granted) and since A implies B, you will agree about B. 

-(£. 1 . 1 , 0 ) 

I-£1-1 

£ii- £2b 

b £ 

where £ is the locus were B is affirmed, £.1 and £.2 are the loci of the 
premises of B (respectively A and A=> B). The subdesign corresponding to 
the justification of A is the design: 

- 0 

b £-1.1 

That is: A appears here as being a data, not needing to be justified. The 
affirmation of such a data is then an action (+, £.1.1, 0). The set of loci from 
which some speaker could continue the investigation of A is empty. 

Once again the loci to which the addressee could cling are not available. 

4 Conclusion 

The model we proposed has two essential features. The descriptive one, is done 
by the fact we retrieve the simple form of communicative exchange in the ba¬ 
sic structure of formal dialogues. The prospective one, commits us to observe 
complex processes inside dialogues. 



Dialogues in Ludics 


157 


Evidently, being descriptive is required by a genuine notion of model. But, by 
the fact of this double feature, we obtain a multi-scale theory, giving possibility of 
viewing objects at differents levels of granularity, depending on what we want to 
study in them. If our focus is on strategic remodelling processes upon arguments, 
we could make use of complex operations inside dialogues plans. If we just want 
to describe the chronology of actions, taking in account the fact that agents 
anticipate the plays of their opponents, we do not need more than the elementary 
decomposition level. So the model ensures continuity between fine, refined and 
complex levels, without change of formal background, just going deep beyond 
the surface structure of objects. 

We must observe that this work opens the way to many others formalisations 
or conceptual problems. It takes a very important place in our program which 
consists in investigating possibilities offered by the geometrical turn in furnish¬ 
ing and deploying mathematical means for human and social sciences^- The 
formalisation of dialogues takes a great position in this project, by the fact the 
notion is intrinsically connected to epistemic, semiotic, pragmatic and semantic 
layers. 
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